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Don’t feed the trolls 


Patent abuse slows down research and innovation, and must be confronted. Delays to 
US legislation are not reassuring, but there has been some progress in the courts. 


for Rob Carlson. A consultant in Seattle, Washington, Carlson 

is part of a growing movement of biohackers who tinker with 
biotechnology in their garages. But when word reached the media 
several years ago that Carlson intended to commercialize his inven- 
tions, he was threatened with lawsuits from people who claimed to 
hold patents that covered the entire field of proteomics — the study 
of the proteins that make up cells and organisms. If Carlson did not 
pay a substantial settlement, the purported patent holders said, they 
would sue him for infringement. 

And so Carlson was introduced to the world of patent trolls, a pejora- 
tive term for people or organizations who file or license patents solely to 
use them to extort money from firms that infringe them. In 2012, patent 
trolls accounted for 62% of all patent legislation in the United States; 
in 2011, such cases cost companies US$29 billion. Attempts to rein in 
patent abusers are mounting. On 2 June, the Supreme Court issued a 
decision that tightens requirements for patent claims to be clear and 
unambiguous, potentially limiting the broad claims that foster abuse. 
US President Barack Obama has made tackling trolls a priority, and 
last December, the House of Representatives passed a bill proposing an 
‘Innovation Act, which included measures to counteract the problem. 

But on 21 May, Senator Patrick Leahy (Democrat, Vermont), chair of 
the Senate Judiciary Committee, announced that he was taking the bill 
off the committee's agenda. He said that nearly a year of hard work had 
failed to produce legislation that would temper trolls without harm- 
ing genuine patent holders. With Congress heading into an election in 
November, it is unlikely that the bill will be resurrected this year. 

The legislation’s demise highlights how hard it is to design patent 
policy that satisfies two large technological domains. Trolls mainly tar- 
get technology firms, in part because patents on software and business 
methods are often broad. Many technology companies already find 
patents to be a nuisance — particularly those that make broad claims 
on business methods or software (see Nature 509, 152-154; 2014). 

By contrast, the biotechnology and pharmaceutical industries hold 
their patents dear: intellectual-property protection can be important 
during the often lengthy struggle to win regulatory approval for a drug or 
genetically engineered crop. Trolls have not so far tended to trouble these 
industries, but that may change. Ina study released this year, researchers 
found dozens of university-held patents that could be deployed against 
bioscience companies, including some that cover methods to screen for 
or manufacture new drugs (R. Feldman and W. N. Price UC Hastings 
Research Paper No. 93 http://doi.org/s2m; 2014). 

Universities also value patents, both to encourage the commerciali- 
zation of academic inventions and as a source of revenue. Academic 
technology-transfer offices tend to raise the bulk of their funds from 
licensing biomedical patents. In April, the Association of Univer- 
sity Technology Managers joined groups including the Biotechnol- 
ogy Industry Organization in signing a letter to Leahy, opposing 


N= long after news of his experiments got out, the trolls came 


the proposed Innovation Act. The bill would make it difficult and 
expensive to enforce their patents, they said. 

University opposition to the Innovation Act has fuelled claims that 
some academic institutions have themselves become patent trolls. 
Industry insiders have made this assertion many times over the years, 
particularly as universities have become more aggressive in protecting 
their patent holdings by suing potential infringers. A popular term 

for a patent troll is a ‘non-practising entity’: 


“Nearly ayear a party that does not intend to market prod- 
of hard work ucts based on its patents. By this very broad 
failedtoproduce _ definition, universities — which license 
legislation that their patents instead of marketing products 
would temper directly — would be patent abusers. But of 
trolls without course they are not, as long as they hold their 
harming mission to help society above their drive to 
genuine patent bring in cash. Academic institutions need 
holders.” to make their priorities clear: the practice of 


licensing patents to trolls to raise funds does 
not help their public image (see Nature 501, 471-472; 2013). 

It is disappointing that Congress will do nothing in the near future 
to slow the steady march of the patent troll. But luckily, legislation is 
not the only option. By the end of June, the Supreme Court is expected 
to rule in a complex patent case that could narrow the scope of soft- 
ware and business-methods patents (see Nature 507, 410-411; 2014). 
The US Patent and Trademark Office has initiatives to make it easier to 
determine who owns a patent. And the US Federal Trade Commission 
is studying troll behaviour. If the target is better defined, it may well 
become easier to design legislation that hits the mark. m 


Renewed energy 


Reforms at the US Department of Energy are 
recharging research. 


hen physicist Steven Chu took over as head of the US 
W/ Paves of Energy (DOE) in 2009, he vowed to reform 

its research culture. Many felt that the department had 
become much too bureaucratic — too rigid, too unresponsive to new 
opportunities, too divided into disciplines and too isolated from the 
needs of the marketplace. 

The following year, Chu launched five Energy Innovation Hubs 
intended to mimic the research style that he remembered from his time 
working at the AT&T Bell Labs in Murray Hill, New Jersey. Each hub 
would focus on a well-defined challenge in the area of renewable energy 
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—a top priority for the then-new administration of US President Barack 
Obama. It would bring together all the necessary expertise, from basic 
and applied research to engineering and early product development. 

Four years later, there is justified, if cautious, optimism about the out- 
come of Chu’s experiment. Viewed purely as research projects, most of 
the hubs seem to be doing well. In the next few months, the Joint Center 
for Artificial Photosynthesis, headquartered at the California Institute of 
Technology in Pasadena, hopes to demonstrate a first-generation pro- 
totype of an ‘artificial leaf’ — a cheap, robust and highly efficient system 
able to make liquid fuels out of sunlight, air and water (see page 22). The 
Joint Center for Energy Storage Research, headquartered at the DOE's 
Argonne National Laboratory near Chicago in Illinois, is likewise mak- 
ing good progress towards its goal: devices that can store much more 
electricity in much less space than the current champions, lithium-ion 
batteries (see Nature 507, 26-28; 2014). 

Only one of the five hubs has fallen by the wayside. The Energy 
Efficient Buildings hub, headquartered in Philadelphia, Pennsylvania, 
was eventually judged to be too diffuse in its goals for DOE purposes, 
and too oriented towards trying to get people to use currently avail- 
able technology. But it still exists. In April it took a new name — the 
Consortium for Building Energy Innovation — and relaunched itself 
as an independent research and demonstration centre. 

There are also grounds for optimism about the hubs larger purpose 
of transforming the DOE research culture — although in this case, the 
progress is less clear-cut. In some ways the agency is as bureaucratic 
as ever. And talk of change within the department has provoked its 
share of resistance from individuals who feel that their programmes 
are threatened. 

Nevertheless, there is considerable excitement in the DOE — a sense 
of new opportunities, new ventures, new people. The hubs are respon- 
sible for some of that feeling, as are innovations such as the Advanced 
Research Projects Agency — Energy (ARPA-E), established in 2009 
to fund speculative, high-risk, high-reward investigations, and a net- 
work of Energy Frontier Research Centers, launched the same year to 
promote cutting-edge basic research. 

Butat least as important is the sense that the people at the top under- 
stand and support reform. Chu’s initiatives have been continued by 


his successor, physicist Ernest Moniz — who last year told Congress 
that the hubs would be a good model for reforming the DOE's network 
of 17 national laboratories. Last month, Moniz appointed a panel to 
review the national labs, with a report due early next year. 

Obama’ administration has been supportive. In both his 2013 and 
2014 State of the Union addresses, Obama called for a US$1-billion 
National Network for Manufacturing Innovation. An interagency 

programme modelled in part on the DOE's 


“There is energy hubs, this would comprise 15 or more 
considerable centres looking to cut the energy, time and 
excitement in materials required to make things. The goal 
the Department __ istohelp US industries to compete with low- 
of Energy —a cost factories in emerging nations such as 
sense of new China, and to make it easier for start-up com- 
opportunities, panies — including many renewable-energy 
new ventures, firms — to bring new products to market. 


Congress has not yet acted on this proposal, 
but the administration has established several 
centres using existing funds from the DOE and other agencies. 

Such efforts need to be supported and encouraged — especially by 
Congress, which holds the federal purse strings, and by the energy 
industry, which can tap vast amounts of cash for activities it perceives 
to be in its interest. And even here there is reason for optimism. 
Despite the ideological warfare that has riven Washington DC in 
recent years, both parties have generally endorsed the DOE's reform 
efforts. And industry leaders seem ready to work closely with research- 
ers to bring innovative products to market. One example is the Clean 
Energy Trust, a Chicago-based consortium of energy companies that 
supports renewable-energy start-ups. 

Congress and the Obama administration could greatly help this 
movement by reviving the idea of the Clean Energy Deployment 
Administration: a ‘green bank that would pool public and private 
money for large-scale investments in clean-energy infrastructure. 
The idea was proposed a few years ago, but abandoned amid budget 
wrangles. Now that the federal deficit is easing and the economy has 
begun to improve, it could find renewed support on both sides of the 
aisle. The future, for once, is starting to look brighter. = 


new people.” 


Integrity mentors 


Policies in Ireland and China make Nature’s 
2014 mentoring awards timely. 


erfully worded statement Towards Excellence in Science 

(go.nature.com/pnhi9k). In encouraging a scientific culture 
of challenging the status quo, it includes a passage that speaks to 
laboratory behaviour: “To achieve scientific excellence, the sci- 
entific community needs to consciously advocate and uphold the 
scientific spirit, promote the value and focus of science in seeking 
truth and innovation, establish management structures and mecha- 
nisms that suit the characteristics and rules of scientific research, and 
discourage scientific behaviour aimed only at short-term success or 
individual benefits.” 

This week, the Irish Universities Association has issued a 
Concordat on research integrity, which includes mention of two 
aspects (among several) of scientific behaviour needing support: 
“reliability in performing research (meticulous, careful and atten- 
tive to detail), and in communication of the results (fair and full and 
unbiased reporting), and objectivity: interpretations and conclusions 
must be founded on facts and data capable of proof and secondary 
review; there should be transparency in the collection, analysis and 


| ast month the Chinese Academy of Sciences issued a pow- 
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interpretation of data, and verifiability of the scientific reasoning.” 

Such statements could all too easily be ignored unless they have 
teeth. In that spirit, readers might do well to focus on a clause ina 
document produced by Science Foundation Ireland (SFI), the 
country’s main funding agency and a collaborator on the integrity 
Concordat. On page 32 of its strategic plan, Agenda 2020, is the state- 
ment that research integrity will be scrutinized by external audits 
(go.nature.com/xjudiz). Congratulations to the SFI for showing more 
determination than most to back words with actions. 

Excellent science requires, not least, a capacity for researchers to be 
ruthlessly self-critical — in other words, assuring technical integrity. 
On discovering something interesting, they need to assume at the 
outset that they are deluded — that the combination of their object 
of study and their experimental, or theoretical or simulation set-up is 
conspiring to make them mistakenly believe that they have a startling 
new insight to offer an admiring world. They need to show their analy- 
ses or data to trusted but critically minded colleagues, in order to avoid 
mistakes and cherry-picking. Such a culture is best bred by tough but 
supportive laboratory mentors. In its annual mentoring awards, which 
has been held since 2005, Nature has rewarded outstanding mentors 
in many countries and regions. 

Given Ireland’s evident determination to sustain best practices, it 
is timely that this year’s mentoring competition 
is for scientists resident in that country and in 
Northern Ireland. Candidates need to be nomi- 
nated by past mentees using forms available at 
go.nature.com/hmezau. Deadline: 4 August. m 
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WORLD VIEW .cesicornon 


hen a team of cosmologists announced at a press 
W eosin in March that they had detected gravitational 

waves generated in the first instants after the Big Bang, the 
origins of the Universe were once again major news. The reported 
discovery created a worldwide sensation in the scientific community, 
the media and the public at large (see Nature 507, 281-283; 2014). 

According to the team at the BICEP2 South Pole telescope, the 
detection is at the 5-7 sigma level, so there is less than one chance 
in two million of it being a random occurrence. The results were 
hailed as proof of the Big Bang inflationary theory and its progeny, 
the multiverse. Nobel prizes were predicted and scores of theoretical 
models spawned. The announcement also influenced decisions about 
academic appointments and the rejections of 
papers and grants. It even had a role in govern- 
mental planning of large-scale projects. 

The BICEP2 team identified a twisty (B-mode) 
pattern in its maps of polarization of the cosmic 
microwave background, concluding that this was 
a detection of primordial gravitational waves. 
Now, serious flaws in the analysis have been 
revealed that transform the sure detection into 
no detection. The search for gravitational waves 
must begin anew. The problem is that other 
effects, including light scattering from dust and 
the synchrotron radiation generated by electrons 
moving around galactic magnetic fields within 
our own Galaxy, can also produce these twists. 

The BICEP2 instrument detects radiation at 
only one frequency, so cannot distinguish the cos- 
mic contribution from other sources. To do so, the BICEP2 team used 
measurements of galactic dust collected by the Wilkinson Microwave 
Anisotropy Probe and Planck satellites, each of which operates over 
a range of other frequencies. When the BICEP2 team did its analysis, 
the Planck dust map had not yet been published, so the team extracted 
data from a preliminary map that had been presented several months 
earlier. Now a careful reanalysis by scientists at Princeton University and 
the Institute for Advanced Study, also in Princeton, has concluded that 
the BICEP2 B-mode pattern could be the result mostly or entirely of 
foreground effects without any contribution from gravitational waves. 
Other dust models considered by the BICEP2 team do not change this 
negative conclusion, the Princeton team showed (R. Flauger, J. C. Hill 
and D.N. Spergel, preprint at http://arxiv.org/abs/1405.7351; 2014). 

The sudden reversal should make the scientific community con- 
template the implications for the future of cosmology experimentation 
and theory. The search for gravitational waves is 


not stymied. At least eight experiments, includ- DNATURE.COM 
ing BICEP3, the Keck Array and Planck, are _ Discuss this article 
already aiming at the same goal. online at: 

This time, the teams can be assured that the _go.niatuire.com/yhezeb 


THE INFLATIONARY 
PARADIGM IS 
FUNDAMENTALLY 


UNTESTABLE, 


AND HENCE 
SCIENTIFICALLY 


MEANINGLESS. 


Big Bang blunder bursts 
the multiverse bubble 


Premature hype over gravitational waves highlights gaping holes in models 
for the origins and evolution of the Universe, argues Paul Steinhardt. 


world will be paying close attention. This time, acceptance will require 
measurements over a range of frequencies to discriminate from fore- 
ground effects, as well as tests to rule out other sources of confusion. And 
this time, the announcements should be made after submission to jour- 
nals and vetting by expert referees. If there must bea press conference, 
hopefully the scientific community and the media will demand that it 
is accompanied by a complete set of documents, including details of the 
systematic analysis and sufficient data to enable objective verification. 

The BICEP2 incident has also revealed a truth about inflationary the- 
ory. The common view is that it is a highly predictive theory. If that was 
the case and the detection of gravitational waves was the ‘smoking gu’ 
proof of inflation, one would think that non-detection means that the 
theory fails. Such is the nature of normal science. 
Yet some proponents of inflation who celebrated 
the BICEP2 announcement already insist that the 
theory is equally valid whether or not gravitational 
waves are detected. How is this possible? 

The answer given by proponents is alarming: 
the inflationary paradigm is so flexible that it is 
immune to experimental and observational tests. 
First, inflation is driven by a hypothetical scalar 
field, the inflaton, which has properties that can 
be adjusted to produce effectively any outcome. 
Second, inflation does not end with a universe 
with uniform properties, but almost inevitably 
leads to a multiverse with an infinite number of 
bubbles, in which the cosmic and physical prop- 
erties vary from bubble to bubble. The part of the 
multiverse that we observe corresponds to a piece 
of just one such bubble. Scanning over all possible bubbles in the multi- 
verse, everything that can physically happen does happen an infinite 
number of times. No experiment can rule out a theory that allows for 
all possible outcomes. Hence, the paradigm of inflation is unfalsifiable. 

This may seem confusing given the hundreds of theoretical papers 
on the predictions of this or that inflationary model. What these papers 
typically fail to acknowledge is that they ignore the multiverse and 
that, even with this unjustified choice, there exists a spectrum of other 
models which produce all manner of diverse cosmological outcomes. 
Taking this into account, it is clear that the inflationary paradigm is 
fundamentally untestable, and hence scientifically meaningless. 

Cosmology is an extraordinary science at an extraordinary time. 
Advances, including the search for gravitational waves, will continue 
to be made and it will be exciting to see what is discovered in the com- 
ing years. With these future results in hand, the challenge for theorists 
will be to identify a truly explanatory and predictive scientific para- 
digm describing the origin, evolution and future of the Universe. m 


Paul Steinhardt is professor of physics at Princeton University. 
e-mail: steinh@princeton.edu 
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Selections from the 
scientific literature 


RESEARCH HIGHLIGHTS 


Tumour reined in 
by its neighbours 


Two groups have found that 
certain non-cancerous cells 
that surround a tumour in the 
pancreas keep it in check, even 
though these cells seem to 
boost other types of cancer. 

Raghu Kalluri at the 
University of Texas MD 
Anderson Cancer Center in 
Houston and his colleagues 
deleted fibroblast cells, which 
are found in connective tissue, 
in a mouse model of pancreatic 
cancer. These animals had 
more aggressive tumours and 
died sooner than those with 
fibroblasts. These cells seem 
to help the immune system to 
fight the tumour. In human 
pancreatic tissue samples, those 
with fewer fibroblasts were 
associated with lower patient 
survival rates. 

Kenneth Olive at Columbia 
University in New York, Ben 
Stanger at the University of 
Pennsylvania in Philadelphia 
and their colleagues also 
found that in animals that had 
fewer connective tissue cells 
in their pancreatic tumours, 
the tumours grew faster and 
had more blood vessels than in 
control mice. 

The results could help to 
explain why certain drugs that 
target a tumour’s neighbouring 
cells have failed in pancreatic 
cancer clinical trials, the 
authors say. 

Cancer Cell http://doi.org/szg; 
http://doi.org/szh (2014) 


Fetal cells havea 
good memory 


Immune cells that 
‘remember’ past encounters 
with foreign molecules 

have been found in human 
umbilical cord blood, even 
though the fetal environment 


Nectar feast in fossil belly 


A 47-million-year-old fossil bird with pollen grains in its belly 
is the first direct evidence of nectar-feeding in birds. 

Gerald Mayr and Volker Wilde at the Senckenberg Research 
Institute in Frankfurt, Germany, analysed the fossil of a small 
bird, Pumiliornis tessellatus (pictured), which is similar to 
cuckoos or parrots. Its thin, long beak resembles those of 
other nectar-slurping birds, such as hummingbirds. Electron 
microscopy revealed pollen grains in the bird’s stomach 


(boxed). 


P tessellatus is not closely related to present-day nectar- 
feeding birds, suggesting that interactions between birds and 
flowers predate those species, the authors say. 


Biol. Lett. http://doi.org/szj (2014) 


is presumed to be sterile. 
Human T cells begin to form 
when the fetus is about ten 
weeks old, but the cells were 
thought to remain unexposed 
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to foreign antigens until after 
birth. However, Richard 
Lo-Man of the Pasteur Institute 
in Paris and his team founda 
subset of fetal T cells known as 
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memory cells, which remember 
foreign molecules and respond 
more rapidly to them during 

a second exposure. These 
memory cells make up 1-6% 
of CD4, or helper, T cells in 
healthy neonatal cord blood. 
The identities of the antigens 
they react with are unknown. 
Sci. Transl. Med. 6, 238ra72 
(2014) 


Antimatter passes 
charge test 


Physicists have tightened the 
limits on the possible charge of 
an atom of antihydrogen. 

The neutral charge on many 
atoms and molecules has been 
measured with extremely high 
precision. The standard model 
of physics says that hydrogen’s 
antimatter counterpart should 
have an opposite charge and 
so be neutral to a similar level. 
Any differences between the 
two could help to explain why 
the Universe contains more 
matter than antimatter. 

Joel Fajans at the University 
of California, Berkeley, and 
his colleagues used data 
from previous experiments 
to analyse the influence of 
electric fields on antihydrogen 
atoms released from a 
magnetic trap. They found 
that the atom was charge- 
neutral, with a limit 1 million 
times lower than the best 
previous figure. 

Nature Commun. http://dx.doi. 
org/10.1038/ncomms4955 
(2014) 


Ancient coral reefs 
protected fish 


As the climate fluctuated over 
the past 3 million years, stable 
coral reefs provided a safe 
haven for tropical fish species, 
leading to the vast diversity of 
reef fishes seen today. 
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David 
Mouillot at 
Montpellier 
University 
in France and 
his colleagues used 
sediment cores to map 
coral reef habitats over the 
past 3 million years and 
compared this analysis with 
the current distribution of 
more than 6,000 reef fish 
species. The team found that 
the historical proximity of the 
fish habitats to stable coral 
reefs during cold periods 
had a greater influence on 
current patterns of reef fish 
diversity than present-day 
environmental factors such 
as sea-surface temperature. 
The diversity of damselfishes 
(Pomacentridae, pictured) in 
particular was strongly linked 
to their distance from coral 
reef refuges. 

As the climate warms, 
conservation biologists 
should focus on protecting 
coral reefs that connect 
historical refuges. 

Science 344, 1016-1019 (2014) 


First crust looked 
like Iceland 


Ancient rocks from Canada 
suggest that Earth’s first 
land was made billions of 
years ago in the same way as 
Iceland. 

Geologists think that 
continental crust first formed 
as magma broke through 
the dense ocean crust and 
crystallized on top. Iceland 
is considered to be the best 
modern analogue of early 
continental crust formation, 
but until now no ancient rocks 
have been found with the 
same chemistry as Icelandic 
ones. Jesse Reimink and his 
colleagues at the University 
of Alberta in Edmonton, 
Canada, discovered that 
4.02-billion-year-old rocks 
in northwestern Canada 
share the same geochemical 
signatures as rocks from 
Iceland, but have signatures 
that are distinct from 
3.9-billion- to 2.5-billion- 
year-old rocks. 


The Canadian 
rocks were 

probably formed as 
magma cooled and 
solidified in shallow waters, 
the scientists say. 
Nature Geosci. http://doi.org/ 
szb (2014) 


Brain signals move 
paralysed hand 


Paralysed monkeys were able 
to move their hands after 
researchers rerouted brain 
signals into the animals’ spinal 
cord. 

Jonas Zimmermann and 
Andrew Jackson at Newcastle 
University, UK, implanted 
recording electrodes in the 
premotor cortex area of the 
brains of two monkeys and 
stimulating electrodes in the 
monkeys spinal cords. The 
authors trained the animals to 
pulla lever for a food reward 
and then temporarily paralysed 
them by injecting a chemical 
into the motor cortex. 

When the paralysed 
monkeys tried to pull the lever, 
a computer program translated 
the electrical impulses from 
the brain into stimulation 
signals that were transmitted 
to the spinal cord. The animals 
could move the lever a greater 
distance and more frequently 
when the signals were 
transmitted compared with 
when they were not. 

The approach could one 
day be used to make paralysed 
human limbs perform a variety 
of tasks, the authors say. 

Front. Neurosci. 8, 87 (2014) 


ATMOSPHERIC SCIENCE 


Climate concerns 
from refrigerants 


Chemicals that are slowly 
escaping from refrigerators 
and air conditioners area 
greater threat to the climate 
than previously thought. 
Hydrofluorocarbons 
(HFCs) are refrigerants that 
are replacing banned ozone- 
depleting chemicals. But 
HFCs are also greenhouse 
gases, and the amount of 
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SOCIAL SELECTION 


Popular articles 
on social media 


Winner takes all in science 


Many researchers on Twitter have been sharing an article 
that supports their hard-won observations: science funding 
doesn't always go to the most deserving. The article, by 
sociologist Yu Xie at the University of Michigan in Ann 
Arbor, notes that science now resembles a “winner-takes-all” 
system that awards a disproportionate share of resources 

to a minority of researchers and institutions. Timothy 
O'Leary, a neuroscientist at Brandeis University in Waltham, 
Massachusetts, tweeted that “inequality is OK provided the 
90% can make a living”. But even that is proving difficult. 
“Postdocs are having permanent conversations about making 
the transition out of science,’ he says. “They have to move up 


or move out.” 


Xie, Y. Science 344, 809-810 (2014) 


Based on data from altmetric.com. 
Altmetric is supported by Macmillan 
Science and Education, which owns 
Nature Publishing Group. 


these chemicals stored in 
refrigeration equipment is 
increasing. Guus Velders at the 
National Institute for Public 
Health and the Environment 
in Bilthoven, the Netherlands, 
and his colleagues analysed 
different scenarios for reducing 
HFCs, either by phasing out 
production or by destroying 
HFC-containing equipment. 

They found that phasing out 
production of HFCs early, such 
as by 2020, would provide the 
greatest environmental benefit. 
If the phase-out were to occur 
later, millions of refrigeration 
and air-conditioning units 
would need to be destroyed to 
achieve the same benefit. 
Atmos. Chem. Phys. 14, 
4563-4572 (2014) 


CLIMATE CHANGE 


Fading winter fog 
threatens crops 


Climate change has reduced 
the amount of winter fog in 
California (pictured), possibly 
damaging the region's fruit and 
nut crops. 

Much of the fruit and nut 
production in the United 
States comes from the Central 
Valley of California. Dennis 


NATURE.COM 
For more on 
popular papers: 
go.nature.com/f79a6z 


— 
—— 


Baldocchi and Eric Waller at 


the University of California, 
Berkeley, analysed 33 years 

of satellite data and found 

that the number of winter fog 
events in this region decreased 
by 46% on average over 

32 winters. 

Winters with less fog will 
yield fewer hours of winter 
chill, which fruit and nut 
trees need in order to rest and 
prepare for the next growing 
season. 

Geophys. Res. Lett. http://doi. 
org/szk (2014) 
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SEVEN DAYS nescnnss 


Misconduct update 
Haruko Obokata, a stem- 
cell researcher at the RIKEN 
Center for Developmental 
Biology in Kobe, Japan, has 
agreed to retract one of her 
two controversial research 
papers, according to media 
reports in Japan on 28 May. 
In the papers published in 
Nature in January, Obokata 
claimed that she had created 
anew type of stem cell using 
simple exposure to acid or 
physical stress. But the work 
was found to have included 
manipulated and duplicated 
images, and in April a 
RIKEN investigation 

found Obokata guilty of 
misconduct. See go.nature. 
com/wsfox5 for more. 


Weight issues 
Obesity is on the rise 
throughout the world, 
researchers announced on 

28 May. Data from more 

than 19,000 individuals in 

188 countries show that the 
prevalence of overweight and 
obese people rose by 27.5% 
for adults and by 47.1% for 
children between 1980 and 
2013. Almost two-thirds of 
obese people live in developing 
countries. The study found 
no countries in which obesity 
rates had fallen significantly 
(M. Ng et al. Lancet http://doi. 
org/szv; 2014). 


1Q on trial 


Some defendants with 
intelligence-quotient (IQ) 
scores of more than 70 may 
now escape execution on the 
basis of intellectual disability, 
after a US Supreme Court 
ruling on 27 May. The ruling 
came in favour of Freddie Lee 
Hall, a convicted murderer 

in Florida who has spent 

35 years on death row. Federal 
law prohibits the execution of 


Cleaner power in prospect 


Existing US power plants will have to cut 
their carbon dioxide emissions by 30% below 
2005 levels by 2030 under a plan from the 
Environmental Protection Agency (EPA) 
released on 2 June. The proposal will help to 
steer the United States towards reducing total 
greenhouse-gas emissions by 17% below 2005 
levels by 2020, to meet a 2009 pledge from 
President Barack Obama. Power plants (such 


intellectually disabled people, 
defined under Florida law as 
an IQ of less than 70. In his 


latest test, Hall had an IQ of 71. 


The Supreme Court affirmed 
the position of psychiatrists 
that IQ tests have a ten-point 
margin of error. The case now 
returns to Florida's court for a 
reassessment. See go.nature. 
com/xiiitc for more. 


EU clinical trials 
New rules governing clinical 
trials across Europe became 
law on 27 May when they 
were published in the Official 
Journal of the European 
Union. The rules were 
overhauled to simplify how 
trials are approved, including 
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harmonizing application 
procedures across all 

28 member states, and include 
an obligation to disclose 
research results. They will 
come into force in mid-2016. 


Patent clarity 

The US Supreme Court 
moved to rein in ambiguous 
patents in a ruling on 2 June. 
The decision relates to a 

case concerning a patent on 
heart-rate monitors used 
during exercise. A lower court 
had ruled that the patent was 
valid despite being somewhat 
ambiguous. The Supreme 
Court said that the lower court 
should have higher standards 
for patent clarity. The move 
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as the Bruce Mansfield Plant in Pennsylvania, 
pictured) are the single largest source of carbon 
pollution in the country. Regulations exist to 
limit emissions of pollutants such as arsenic 
and mercury, but there are no national controls 
on carbon-emission levels. The EPA expects to 
finalize the plan by June 2015, after a 120-day 
period for public comment. See go.nature.com/ 
gjyruv for more. 


could have implications 

for ‘patent trolls, who take 
advantage of ambiguous 
patents to sue companies for 
infringement. See page 7 

for more. 


Rights to therapy 


Patients do not have an 
automatic right to try a 
compassionate therapy when 
there is no scientific evidence 
that it works, according to 

a landmark ruling of the 
European Court of Human 
Rights in Strasbourg, France. 
The 28 May ruling referred to 
the case of Nivio Durisotto, 
who wanted his daughter, 
who has a degenerative brain 
disease, to be treated with 
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a controversial stem-cell 
therapy offered by the Stamina 
Foundation in Brescia, Italy. 
Stamina’s president Davide 
Vannoni — who is facing 
charges of fraud — had 
encouraged patients to appeal 
for the right to treatment 

on compassionate grounds. 
See go.nature.com/xqdpfb 
for more. 


GM compromise 
The European Union (EU) 
reached a compromise on 
the cultivation of genetically 
modified (GM) crops, in 

a closed-door meeting on 

28 May. EU member states 
are currently supposed to 
allow cultivation of any 

GM crop that the European 
Food Safety Authority 
deems safe for health and 
the environment. But some 
countries have refused, 
creating a constitutional 
stand-off. Now, EU member 
states will be allowed to opt 
out of growing GM crops on 
non-scientific grounds, such 
as issues relating to town and 
country planning. European 
environment ministers 

will vote on the proposal 

on 12 June; it also requires 
parliamentary approval. 


Kavli prizes 


Nine scientists shared the 
three 2014 biennial Kavli 
prizes (worth US$1 million 


TREND WATCH 


This year, the United States 
has already experienced the 
greatest number of measles 
cases for 20 years, researchers 
from the Centers for Disease 
Control and Prevention in 
Atlanta, Georgia, announced 


last week. The country declared 
the disease eliminated in 2000, 

and more than 90% of its young 
population are vaccinated. The 


most likely reason for the rise, the 


researchers say, is that the virus is 
being imported by unvaccinated 
US travellers who then mingle 
with other unvaccinated people. 


each), announced on 29 May. 
Alan Guth, Andrei Linde 
and Alexei Starobinsky won 
in astrophysics, for their 
work developing the theory 
of cosmic inflation. The 


nanoscience prize went to 
Thomas Ebbesen, Stefan Hell 
and John Pendry, for studies 
defeating the ‘resolution 
limit’ of optical imaging 

and microscopy — showing 
that light can interact with 
nanostructures smaller than 
the light’s wavelength. In 
neuroscience, Brenda Milner 
(pictured), John O’Keefe 
and Marcus Raichle won for 
pinpointing brain regions 
involved in memory and the 
key functions of specialized 
nerve cells. See go.nature.com/ 
ifevbf for more. 


Shaw prizewinners 
The Shaw Prize Foundation 
in Hong Kong announced 

the winners of its annual 
prizes on 27 May. Worth 
US$1 million, the prizes 


MEASLES SPIKE 


are awarded in the areas of 
astronomy; mathematical 
sciences; and life science and 
medicine. The winners include 
Daniel Eisenstein of Harvard 
University in Cambridge, 
Massachusetts, for his work 
on waves originating in the 
early Universe called baryonic 
acoustic oscillations, and 
Kazutoshi Mori of Kyoto 
University in Japan for his 
part in discovering a key 
mechanism involved in 
degenerative diseases. 

See go.nature.com/vrpwla 

for more. 


| FUNDING 
Stem-cell petition 


The European Commission 
has rejected a petition for 
legislation to ban the funding 
of research that uses human 
embryonic stem cells. The 
petition, signed by 1.7 million 
European citizens, came from 
a pro-life organization based 
in Brussels called One of Us. 
On 28 May, the commission 
said that stem-cell research 
held great promise and it was 
therefore in the public interest 
to support it. It also agreed that 
human embryonic stem cells 
are still sometimes required in 
such research. See go.nature. 
com/aaétuv for more. 


Brain grants 

The first round of awards 
from a large US multi-agency 
funding effort to study the 


US public-health experts urge vaccination after largest outbreak 


in 20 years. 
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8-13 JUNE 
Geoscientists will meet 
at Goldschmidt2014 in 
Sacramento, California. 
Up for discussion are 
the potential habitability 
of Mars, and the 
environmental legacy of 
California’s gold rush. 
go.nature.com/o8iznc 


brain have gone to work 

on neuropsychological 
disorders. On 27 May, the 
Defense Advanced Research 
Projects Agency in Arlington, 
Virginia, awarded the 
University of California, San 
Francisco, US$26 million 

and Massachusetts General 
Hospital in Boston $30 million 
over five years. The projects 
are funded under the Brain 
Research through Advancing 
Innovative Neurotechnologies 
(BRAIN) initiative. They will 
focus on creating devices that 
stimulate areas deep in the 
brain and record brain activity 
for research on soldiers and 
veterans with conditions 

such as post-traumatic stress 
disorder. 


| _BUSINESS 
Sequencing boost 


Swiss drug company Roche, 
based in Basel, moved to 
boost its genetic-sequencing 
business by acquiring Genia 
Technologies of Mountain 
View, California. Roche 
announced on 2 June that it 
will pay up to US$350 million 
for Genia, which is developing 
a semiconductor-based DNA- 
sequencing technology that 
uses nanopores to read single 
genetic molecules. Roche 
wants to use sequencing in 

its diagnostics business. In 
2012, it unsuccessfully tried 
to buy Illumina of San Diego, 
California, which owns most 
of the global sequencing 
market. 
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Bacteriophages could be a resource for fighting drug- 


MICROBIOLOGY 


resistant bacterial infections. 


Phage therapy 
gets revitalized 


The rise of antibiotic resistance rekindles interest in 


acentury-old virus treatment. 
BY SARA REARDON 


Curtain were denied access to some of 

the best antibiotics developed in the 
West. To make do, the Soviet Union invested 
heavily in the use of bacteriophages — viruses 
that kill bacteria — to treat infections. Phage 
therapy is still widely used in Russia, Georgia 
and Poland, but never took off elsewhere. “This 
is avirus, and people are afraid of viruses,” says 
Mzia Kutateladze, who is the head of the sci- 
entific council at the Eliava Institute in Tbilisi, 
which has been studying phages and using 


F« decades, patients behind the Iron 


them to treat patients for nearly a century. 

Now, faced with the looming spectre of 
antibiotic resistance, Western researchers and 
governments are giving phages a serious look. 
In March, the US National Institute of Allergy 
and Infectious Diseases listed phage therapy 
as one of seven prongs in its plan to combat 
antibiotic resistance. And at the American 
Society for Microbiology 


(ASM) meeting in Bos- NATURE.COM 
ton last month, Grégory For more on 
Resch of the University antibiotics and 

of Lausanne in Switzer- _ phage therapy, see: 
land presented plans for —_go.nlature.com/pykvdd 
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Phagoburn: the first large, multi-centre clini- 
cal trial of phage therapy for human infections, 
funded by the European Commission. 
Ryland Young, a virologist at Texas A&M 
University in College Station, attributes the 
previous lack of Western interest to clinicians’ 
preference for treating unknown infections 
with broad-spectrum antibiotics that kill 
many types of bacterium. Phages, by contrast, 
kill just one species or strain. But researchers 
now realize that they need more precise ways 
to target pathogenic bacteria, says microbiolo- 
gist Michael Schmidt of the Medical University 
of South Carolina in Charleston. Along with 
the rising tide of strains resistant to last-resort 
antibiotics, there is growing appreciation 
that wiping out the human body’s beneficial 
microbes along with disease-causing ones can 
create a niche in which antibiotic-resistant bac- 
teria can thrive. “Antibiotics are a big hammer,’ 
Schmidt says. “You want a guided missile.” 
Finding a phage for a bacterial target is 
relatively easy, Young says. Nature provides an 
almost inexhaustible supply: no two identical 
phages have ever been found. As a bacterium 
becomes resistant to one phage — by shedding 
the receptor on the cell surface that the virus 
uses to enter — the Eliava Institute researchers 
simply add more phages to the viral cocktails 
that patients receive. Kutateladze says that they 
update their products every eight months or 
so, and do not always know the exact combina- 
tion of phages that make up the cocktail. 
Resch, who is one of the leaders of the Phago- 
burn study, says that regulatory agencies would 
need to figure out how to oversee such a rapidly 
evolving product before the therapy could pro- 
gress beyond clinical trials. He hopes that phage 
therapy will be treated in a similar way to the 
seasonal influenza vaccine, for instance, which 
is updated every year as new flu strains emerge. 
The fact that the European Union (EU) is 
contributing €3.8 million (US$5.2 million) to 
the Phagoburn study shows that it is willing to 
consider the approach, Resch says. Beginning 
in September, researchers in France, Belgium 
and the Netherlands plan to recruit 220 burn 
victims whose wounds have become infected 
with the common bacteria Escherichia coli or 
Pseudomonas aeruginosa. The patients will 
be given phage preparations from a company 
in Romainville, France, called Pherecydes 
Pharma, which has isolated more than 1,000 
viruses from sources such as sewage or river 
water and screened them for the ability to kill > 
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> pathogenic bacteria. To lower the chance 
that resistance will develop, the patients 
will receive a cocktail of more than a dozen 
phages that enter bacterial cells in different 
ways. If the phage treatment fails, patients 
will then receive standard antibiotics. 

Although governments are starting to 
pay attention to phage therapy, pharmaceu- 
tical companies remain reluctant to get on 
board, Young says. Because phage therapy 
is nearly a century old, it would be diffi- 
cult for a company to claim a treatment as 
intellectual property, and therefore recoup 
its costs. Young says it is likely that a 2013 
ruling by the US Supreme Court against the 
patenting of natural genes would also apply 
to phages isolated from nature. Jéréme 
Gabard, chief executive of Pherecydes, 
says that the company is banking on hopes 
that developing and characterizing precise 
combinations of natural phages to target 
particular bacteria will be patentable. 

Anengineered phage could, in theory, be 
patented. At the ASM meeting last month, 
researchers led by synthetic biologist Tim- 
othy Lu at the Massachusetts Institute of 
Technology in Cambridge presented work 
ona phage engineered to use a DNA-editing 
system called CRISPR to kill only antibiotic- 
resistant bacteria. The phage injects the 
bacterium with DNA, which the microbe 
transcribes into RNA. If part of the bacte- 
rium’s antibiotic-resistance gene matches 
that RNA sequence, an enzyme called Cas9 
cuts up the cell’s DNA, killing it. 

In initial trials, the researchers found that 
their phage could kill more than 99% of 
the E. coli cells that contained specific anti- 
biotic-resistance gene sequences, whereas it 
left susceptible cells alone. Giving the phage 
to waxworm larvae infected with resistant 
E. coli increased the worms’ chance of sur- 
vival. The researchers are now starting to 
test the system in mice (human trials are a 
long way off). 

Gabard does not expect that phage 
therapy will ever replace antibiotics. But he 
says that he can envisage regulatory agen- 
cies approving it for patients in whom drug 
treatments have failed. And some people 
with antibiotic-resistant infections are 
taking matters into their own hands. Kutate- 
ladze says that an increasing number of EU 
patients are travelling to Georgia for phage 
treatment. She adds that doctors in some EU 
countries send patients’ samples to the Eli- 
ava Institute, which then sends back a phage 
cocktail specific to the bacterium causing 
the infection. “When there’ no hope, you'll 
do anything,’ Schmidt says. 

Meanwhile, researchers are watching the 
Phagoburn study with interest, hoping that 
it will lay the groundwork for moving the 
technology into the clinic. “We just need 
one really great success for the field to really 
open up, says Lu. m 
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Land models put 
to climate test 


Study under way on Mongolian steppes aims to improve 
knowledge of warming effects on vegetation. 


BY JANE Qiu 


See Wan remembers his first 


experience of a major sandstorm more 
than a decade ago in Inner Mongolia. “It 
was like sands being dumped on me bya gigan- 
tic dustbin,’ he says. “I couldn't see anything 
just a few metres away.’ Decades of overgrazing 
were turning the region into desert. 
Grassland is now developing there again 
after strict grazing limits were imposed on 
the autonomous region by China in 2000. 
But Wan, an ecologist at Henan University 
in Kaifeng, worries that another, more chal- 
lenging, menace will eat away at vegetation 
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there and elsewhere: climate change. And if 
grasslands wither on a global scale, “it would 
not only cause widespread desertification, 
but accelerate climate warming by increasing 
carbon dioxide levels in the air’, he says. 

Yet scientists know little about the effects 
of climate change on land ecosystems, or how 
they will affect atmospheric CO). 

Wan and his colleagues have an ambitious 
plan for filling that information vacuum. In 
a US$260,000 operation, they have cut 54 
6-tonne chunks of soil — 2.2 metres by 
1.5 metres, by 1.2 metres deep — from 
three types of grassland in the Mongolian 
Plateau, and are now growing them at the 
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Duolun Ecological Research Station in the 
region. Later this month, the team will start a 
decade-long experiment to test how artificially 
induced warming and varying amounts of 
rainfall affect vegetation growth, water, nutri- 
ents and plant composition. More crucially, it 
will investigate the underlying mechanisms. 

The importance of the experiment “goes 
beyond the Mongolian steppes’, says Alan 
Knapp, an ecologist at Colorado State Univer- 
sity in Fort Collins. “Grasslands across broad 
temperature and precipitation gradients can 
respond quite differently to climate change 
— in some cases, as differently as between a 
grassland and a forest.” 

Models that simulate how terrestrial eco- 
systems respond to climate change and affect 
atmospheric CO, levels fail to capture these 
nuances, Knapp told a meeting last month in 
Beijing. The meeting was funded by the US 
National Science Foundation with the aim of 
fostering closer ties between experimental- 
ists and modellers. The models used assume 
that a given type of vegetation will behave in 
the same way, regardless of the local climate. 
“Historically, ecologists tend to work in their 
backyard and assume everywhere else is the 
same,’ Knapp says. 

Not only that, but studies conducted at 
different locations are often not comparable 
because different teams address different ques- 
tions, and their experimental protocols also 
vary. For this and other reasons, few data from 
the hundreds of manipulation experiments 
conducted worldwide — at a cost of billions 
of dollars — have been incorporated into the 
models, says Yiqi Luo, a climate modeller at the 
University of Oklahoma in Norman. 

Until recently, “there has been limited cross- 
over between experimentalists and modellers’, 
says Jeff Dukes, an ecologist at Purdue Univer- 
sity in West Lafayette, Indiana. Typically, “we 
would turn to modellers years after an experi- 
ment has begun and then realize things that 
we should have measured are not measured”, 
adds Richard Norby, an ecologist at Oak Ridge 
National Laboratory in Tennessee. “We would 
have made a lot faster progress if the two com- 
munities had worked closely together from the 
very beginning.” 

The key to success lies partly in collecting 
the right data for the model from the start. 
Wan’s study will be among the first to 
do this, and is one of a growing number of 
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WORKING THE LAND 


Global-change experiments in China 
are studying how ecosystems react to 
warming, atmospheric carbon dioxide, 
precipitation and additional nitrogen. 


such ‘global-change’ experiments for land 
ecosystems (see “Working the land’). It will 
measure a range of factors that are needed to 
drive models, such as soil carbon content and 
nutrient level, plant biomass and productivity, 
and meteorological parameters. Results from 
the study could, for example, be the first step to 
making comparisons across a rainfall gradient, 
and so improve land models, Luo says. 

Wanis initiative will be welcome. The per- 
formance of land models “is rather poor at the 
moment’, says Philippe Ciais, a climate model- 
ler at the National Centre for Scientific Research 
in Paris. For instance, their predictions of CO, 

release from land eco- 


“Grasslands systems in response 
across broad to changes in atmos- 
temperature and _ pheric CO,, tempera- 
precipitation ture and other climate 
gradients can variables diverge 
respond quite wildly (V. K. Arora 
differently to et al. J. Clim. 26, 


5289-5314; 2013). 
Land models “are the 
weakest link in our projection of future climate 
change’, Ciais adds. 

To understand why the models differ so 
much, the US Department of Energy is fund- 
ing a project, led by Norby, to compare experi- 
mental data and land models. “Initially, we 
wanted to see which models fit the data best,’ 
says Norby. “But we soon realized that this isn’t 
the right approach because models can give the 


climate change.” 
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right answer for the wrong reason.” 

The team found, for instance, that most mod- 
els are flawed in their basic assumptions about 
how plants use nitrogen, but can still reproduce 
some of the effects seen in global-change experi- 
ments because errors in opposite directions can- 
cel each other out (S. Zaehle et al. New Phytol. 
http://dx.doi.org/10.1111/nph.12697; 2014). 
“This means that they are not going to be very 
powerful for predicting the future,’ says Norby. 

At the Beijing meeting, researchers iden- 
tified other weaknesses. The models are 
poor at capturing how vegetation composi- 
tion responds to climate change — a crucial 
factor for predicting changes in carbon stor- 
age in the long term. This is partly because the 
models tend to be based on the physiology of 
single plants and partly because they take scant 
account of plant diversity. 

To improve land models, researchers at the 
meeting plan to launch long-term initiatives to 
probe how ecosystems worldwide respond to 
climate change and to standardize experimen- 
tal designs and methods so that the results can 
be compared. 

The ecologists are also calling for a common 
repository for data from global-change experi- 
ments that can be accessed by modellers — 
similar to existing databases for atmospheric 
and oceanic observations. 

“There is a long way to go,’ says Luo. “To 
connect experimentalists and modellers is the 
key. The bridge has just started being built” m 
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REGENERATIVE MEDICINE 


Funding windfall rescues 
abandoned stem-cell trial 


But more players have joined the quest to treat spinal-cord injury with embryonic cells. 


BY ERIKA CHECK HAYDEN 


landmark stem-cell trial is sputtering 
A‘ to life two-and-a-half years after 

it was abandoned by the California 
company that started it. But it now faces a 
fresh set of challenges, including a field that is 
packed with competitors. 

The trial aims to test whether cells 
derived from human embryonic stem 
cells can help nerves to regrow in cases 
of spinal-cord injury. It was stopped 
abruptly in 2011 by Geron of 
Menlo Park, California (see 
Nature 479, 459; 2011); the 
firm said at the time that it 
wanted to focus on several 
promising cancer treatments 
instead. Now, a new company 
— Asterias Biotherapeutics, also 
of Menlo Park — plans to resur- 
rect the trial with a US$14.3-million 
grant that it received on 29 May from the 
California Institute for Regenerative Medicine 
(CIRM), the state’s stem-cell-funding agency. 

But the field has moved on since Geron 
treated its first patient in 2010, and the therapy 
that Asterias inherited is no longer the only 
possibility for spinal-cord injury. StemCells, a 
biotechnology company in Newark, Califor- 
nia, has treated 12 patients in a safety study 
of a different type of stem cell, and it plans to 
start a more advanced trial this year to test 
effectiveness. And another entrant to the 
field, Neuralstem of Germantown, Maryland, 
received regulatory approval in January 2013 
to begin human tests of its stem-cell product. 

Geron’s human trial was the first approved to 
use cells derived from human embryonic stem 
cells. But regulators halted it twice, once citing 
concerns about the purity and predictability of 
the cells being implanted, and again after the 
company reported seeing microscopic cysts in 
the spinal cords of rats that had been treated in 
preclinical studies. The worry was that the cysts 
could be teratomas — uncontrolled growths 
that can form from embryonic stem cells, a 
feared side effect of treatment. Geron later said 
that the growths were not teratomas, and the 
US Food and Drug Administration allowed the 
trial to proceed. But after injecting the cells into 
five of the ten intended patients, the company 
said that it had run out of money for the trial. 
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Embryonic stem cells may have the 
ability to repair damaged tissue. 


Geron founder Michael West and former 
chief executive Thomas Okarma then formed 
Asterias, which bought Geron’s stem-cell 
therapy last year. The company plans first to 
treat three patients with spinal-cord damage 
in the neck, using a low dose of the stem cells; 
it will then treat different people with higher 
doses to see if the therapy can restore any sen- 
sation or function in the trunk or limbs. 

The five patients previously treated by 
Geron, whom Asterias continues to track, had 
cord damage at chest level. On 22 May, Asterias 
reported that none of those five had experi- 
enced serious side effects from the treatment 
or developed immune responses to it. 

Researchers say that the continuation of the 
former Geron trial is important because it uses 
a type of cell different from the fetus-derived 
ones used by StemCells and Neuralstem. Geron 
surgically implanted embryonic stem cells that 
had been coaxed in vitro to grow into imma- 
ture myelinated glial cells, which insulate nerve 
fibres when mature. The other companies are 
using partially differentiated cells derived from 
fetal brain tissue, which might produce sub- 
stances that protect surviving tissue and make 
new connections in the neural circuitry. 

“It's very good for the field, because we now 
have multiple cell lines being tested in very simi- 
lar populations of patients, and this will help us 
define what is needed to make this approach 
work,” says Martin Marsala, a neuroscientist 
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at the University of California, San Diego, 
whose work has shown that Neuralstem’s cells 
can develop into working neurons and restore 
movement to rats with cord injuries in the neck. 

Still, some in the field question whether 
stem cells are a good candidate for treating 

paralysis. Spinal trauma can destroy a range 
of cells; it can be difficult to access the injury 
site; and damage can be very severe. Treat- 
ing paralysis “may be asking a lot more of 
the cells than they are capable of biologi- 
cally”, says Robert Lanza, chief scientific 
officer at Advanced Cell Technology in Marl- 
borough, Massachusetts, which is now testing 
embryo-derived stem cells in two eye diseases. 

Lanza says that his company opted not to 
buy Geron’s stem-cell therapy in part because 
the treatment did not seem effective enough 
in animal models. Some of the scientists who 
reviewed Asterias’s grant proposal to the 
CIRM also expressed concern about the treat- 
ment’s effectiveness in animals. “It would be 
fantastic if [the cells] work, but I think it would 
also be wise not to set expectations too high,’ 
says Lanza. 

Marsala counters that the cells do not have 
to heal the spinal cord completely to improve 
people’s lives. He notes that simply replacing 
injured or dead cells with healthy ones might 
prevent the damage from spreading. StemCells 
has reported that four people in its studies have 
recovered some sensation; two previously had 
no feeling in their lower limbs and another two 
had some but were able to feel more after the 
cell transplant. Marsala sees this as a sign that 
the therapy is working. “Nobody is claiming 
that you are going to see complete recovery in 
these patients,” he says. 

Christopher Scott, an ethicist at Stanford 
University in California, says that before 
scientists can judge the effectiveness of the 
therapies, the companies must complete their 
trials. But he wonders whether Asterias has suf- 
ficient resources to make good on its promise to 
carry Geron’s therapy through to completion. 

Scott notes that Geron spent hundreds of 
millions of dollars developing spinal-cord and 
other embryonic-stem-cell therapies, but man- 
aged to treat only five people. The restarted trial 
will face even greater scrutiny, and he worries 
that if it were halted, it would be a huge blow to 
patients. “Bioethicists and policy analysts will be 
watching carefully,’ he says. = 
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Research minister Johanna Wanka says planned changes will improve German universities. 


Federal boost for 
German science 


Agreement on changes to constitution pave way for 


university funding shake-up. 


BY QUIRIN SCHIERMEIER 


ermany’s highly federalized political 
Ga has been a pillar of democracy 
for more than halfa century. But even 
as the country’s constitution has underpinned 
its stability, it has been a thorn in the side of the 
scientific community. Universities are largely 
funded by local governments, and there are 
tight restrictions on the uses of federal fund- 
ing — for example, it cannot be used to pay 
permanent staff or infrastructure costs. This 
makes it difficult for institutions to compete 
with the world’s leading research powerhouses. 
But now, after years of discussions, Ger- 
man legislators have finally agreed to revise a 
paragraph in the constitution that prevents the 
federal government from providing permanent 
direct funding to universities. The move has 
pleased scientists and promises to pave the way 
for improved job security, lucrative research 
collaborations and possibly new centres. 
Education has traditionally been the respon- 
sibility of Germany’s 16 states, and they have 
been keen to maintain control. In 2006, for 
example, they signed up to a reform that abol- 
ished federal interference in higher education 
and increased their independence. This made it 
even more difficult for the federal government 
to contribute to the maintenance of the more 
than 100 research universities run by the states. 


There are exceptions: the federal govern- 
ment provides more than half of the €2.5-bil- 
lion (US$3.4-billion) annual budget of the 
DFG, Germany’s main grant-giving agency 
for university research (the rest comes mainly 
from the states). The federal science ministry 
also contributes 75% of a €4.6-billion excel- 
lence initiative, launched in 2006, that allows 
universities to compete for top-up funds to 
bolster neglected disciplines. Since its incep- 
tion, almost 100 graduate schools and large 
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research collaborations have been set up at 
39 universities with this money. In total, the fed- 
eral government pumps about €17 billion into 
the annual research budget, a figure dwarfed by 
the states’ €75 billion. Germany’s total research 
and development budget is roughly 2.9% of 
gross domestic product (see ‘Big spenders’). 

But a lack of permanent federal support 
creates problems. For example, the excellence 
initiative expires in 2017, prompting fears that 
projects will be discontinued and research- 
ers laid off. Scientists therefore applauded the 
planned constitutional change, which will allow 
the federal government to step in with perma- 
nent financial support for projects at risk. 

“We now have much more room to manoeu- 
vre, says Johanna Wanka, federal minister for 
education and research. “This is a great politi- 
cal success and it will lastingly improve condi- 
tions at German universities.” 

The changes, agreed last week and expected 
to come into effect before 2015, will end a six- 
month impasse over a €6-billion fund that the 
government last year promised to inject into 
science and education between 2014 and 2017. 
They will also enable a fundamental reform of 
German science, the heads of Germany’s large 
research organizations said in a joint statement. 

“The promised constitutional change is 
a true sea change that will finally enable the 
government to directly and permanently co- 
finance universities, says Wolfgang Marquardt, 
head of the German Science Council, which 
advises the government on science policy. 

Universities in Germany's poorer states, 
such as the University of Bremen, a past winner 
of an excellence initiative award, are particu- 
larly in need ofa steady flow of federal money. 
Gerold Wefer, a marine scientist at MARUM, 
the university’s centre for marine environ- 
mental sciences, which was set up with award 
funds, is now optimistic that the centre can 
survive beyond 2017. “Otherwise, dozens of 
scientists on fixed-term contracts might have 
to fear for their jobs,” he says. 

How any future cash from the federal gov- 
ernment will be used is yet to be worked out. 
The government has previously ruled out con- 
tinuing the excellence initiative in its current 
form. Instead, says Marquardt, the govern- 
ment could channel extra cash to the DFG, 
enabling the agency to fund existing or new 
collaborations on a competitive basis. Alter- 
natively, he says, the new money could be used 
to create federally funded university institutes 
or regional science centres to continue and 
expand successful research collaborations. For 
example, Wolfgang Herrmann, president of 
the Technical University of Munich and a long- 
time champion ofa German national institute 
of technology, has suggested that his university 
could become a federal technology hub. 

But Marquardt cautions that it is early days. 
“The changes agreed upon open up many pleas- 
ant possibilities,” he says. “We shall now start 
thinking about what might work best.” = 
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COSMOLOGY 


Big Bang finding challenged 


Signal of gravitational waves was too weak to be significant, studies suggest. 


BY RON COWEN 


he astronomers who earlier this year 
TL Pesnoxnee that they had evidence of 

primordial gravitational waves jumped 
the gun, two independent analyses suggest. 

The papers, published on the arXiv preprint 
repository, propose that the original analy- 
sis did not properly account for the 
confounding effects of galactic 
dust. Although further obser- 
vations may yet confirm 
the findings, independent 
researchers now say they 
no longer think that the 
original data constituted 
significant evidence. 

“Based on what we know 
right now, we have no evi- 
dence for or against gravitational 
waves, says UroS Seljak, a cosmolo- 
gist at the University of California, Berke- 
ley, and co-author of one of the latest studies’. 

Astronomers using the BICEP2 radio tele- 
scope at the South Pole announced in March 
that they had found a faint twisting pattern 
in the polarization of the cosmic microwave 
background (CMB), the radiation left over 
from the Big Bang. This pattern, they said, was 
evidence for primordial gravitational waves — 
ripples in the fabric of space-time generated 
in the Universe's first moments. The findings 
were widely hailed as confirmation of the 
theory of cosmic inflation, which holds that 
the cosmos ballooned in size during the first 
fraction of a second after the Big Bang. 

But the new analyses suggest that the twist- 
ing patterns in the CMB polarization could 
just as easily be accounted for by dust in the 
Milky Way’”. 

The papers follow a presentation three weeks 
ago by Raphael Flauger, a theoretical physicist 
at New York University and the Institute for 
Advanced Study in Princeton, who re-exam- 
ined a map of galactic dust used by BICEP2. 
Flauger concluded that the BICEP2 research- 
ers had probably underestimated the fraction of 
polarization caused by dust in the map, which 
was compiled from data from the European 
Space Agency's Planck spacecraft. Flauger says 
that when the dustis fully accounted for, the sig- 
nal that can be attributed to gravitational waves 
either vanishes or is greatly diminished. 

“Thad thought that the [BICEP2] result was 
very secure,” said Alan Guth, the cosmolo- 
gist who first proposed the inflation concept 
in 1980, after learning about Flauger’s talk. 
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“Now the situation has changed,’ added Guth, 
who works at the Massachusetts Institute of 
Technology in Cambridge. 

The BICEP2 researchers have argued that 
the Planck map figured in only one of the six 
models that they used to examine the role of 


Cross-checks of gravitational-wave results used 
this galactic-dust map from the Planck satellite. 


dust. But in a paper’ posted to the arXiv server 
on 28 May, Flauger and his co-authors David 
Spergel and Colin Hill, both of Princeton Uni- 
versity in New Jersey, say that the five other 
models are based on alow estimate — between 
3.5% and 5% — of the fraction of total polari- 
zation caused by galactic dust. Extrapolation 
from amore detailed map, released last month 
by the Planck team, suggests that the fraction 
is closer to 8-15%, Spergel explains. 

With those updated numbers, he says, 
“there's no evidence for the detection of gravi- 
tational waves”. Buta final determination can- 
not be made until a more precise dust map, 
expected to be released by the Planck team in 
October, is available, he adds. 

In the other analysis’, Seljak and Michael 
Mortonson, a cosmologist also at the Uni- 
versity of California, Berkeley, re-examined 
BICEP2 data on how the polarization signal 
varies with the frequency of the microwaves 
it detects. The BICEP2 team had checked its 
results against data recorded at lower fre- 
quency by an older telescope, BICEP1. They 
found that the intensity of polarization did 
not change from one frequency to the other in 
the way expected if it were caused by dust, and 
concluded that the data favoured gravitational 
waves over dust by an 11-to-1 margin. 

But Seljak and Mortonson say that the 
BICEP2 analysis did not exclude data on small 
spatial scales, or fractions of degrees of the sky. 
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That is a problem, Seljak says, because on these 
small scales, gravitational lensing — in which 
the path of light bends around massive objects 
— exactly mimics the twisting polarization 
pattern that gravitational waves imprint on 
larger spatial scales. 

Accounting for lensing, “the primordial 
gravity-wave signal is preferred to dust 
with odds of less than two to one — 
in other words, not significant 
odds at all,” says Seljak. 

BICEP2 co-leader James 

Bock, a physicist at the 
California Institute of 
Technology in Pasa- 
dena, says that although 
his group’s main paper 
“has been revised based on 
many referee comments and 
resubmitted” for publication, the 
evidence for gravitational waves “is 
certainly not being retracted”. The BICEP2 
results “are basically unchanged’; he says. 

Further observations may yet see the cosmic 
ripples emerge from the dust. It is possible that 
forthcoming data from several observatories 
— including the Keck Array, a telescope at the 
South Pole built by the BICEP2 team — and the 
Planck team’s full-sky map of CMB polariza- 
tion will confirm that a signal is there, although 
perhaps not as strong as first suggested. m 


1. Mortonson, M. M. & Seljak, U. Preprint at http:// 
arxiv.org/abs/1405.5857 (2014). 

2. Flauger, R., Hill, J.C. & Spergel, D. N. Preprint at 
http://arxiv.org/abs/1405.7351 (2014). 


CORRECTIONS 

The News story ‘Jelly genome mystery’ 
(Nature 509, 411; 2014) wrongly stated 
that ctenophores might be the closest living 
relatives of the first animals. It should have 
said that ctenophores may represent the 
earliest lineage to split off from the common 
ancestor of all animals. The News story 

‘US Arctic research ship ready to cast off’ 
(Nature 509, 542-543; 2014) incorrectly 
stated that a deck on the vessel able to 
accommodate large equipment is at the 
bow — itis at the stern. The News Feature 
‘Complexity on the horizon’ (Nature 509, 
552-553; 2014) inadvertently underplayed 
the role of Daniel Harlow in bringing 
computational complexity to fundamental 
physics — he worked with Patrick Hayden 
from the start of their project. 
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Researchers make headway in 
turning photons into fuel. 


BY JESSICA MARSHALL 


na bright spring morning in Pasadena, 

California, the air is rich with the smells 

of cut grass and flowers. Photosynthesis 

seems effortless here: the fronds and 
blooms that line the walkways of the Califor- 
nia Institute of Technology (Caltech) bask in 
the sunlight, quietly using its energy to store 
sugars, stretch their leaves, deepen their roots 
and tend to their cellular processes. 
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Inside Caltech’s Jorgensen Laboratory, 
however, more than 80 researchers are putting 
a lot of effort into doing the leaf’s job using 
silicon, nickel, iron and any number of other 
materials that would be more at home inside 
a cell phone than a plant cell. Their gleam- 
ing new labs are the headquarters of the Joint 
Center for Artificial Photosynthesis (JCAP), 
a 190-person research programme funded by 
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the US Department of Energy (DOE) with 
US$116 million over five years. The centre's 
goal is to use sunlight to make hydrogen and 
other fuels much more efficiently than real 
leaves ever made biomass. 

The researchers are pursuing this goal with a 
certain urgency. Roughly 13% of greenhouse- 
gas emissions worldwide come from trans- 
portation, so phasing out polluting fuels is a 
key environmental target. One approach is to 
replace cars and light trucks with electric vehi- 
cles charged by solar cells or wind — but that 
cannot tackle the whole problem. Nathan Lewis, 
an inorganic chemist at Caltech and JCAP’s 
scientific director, says that some 40% of cur- 
rent global transportation cannot be electrified. 
For example, barring a major breakthrough, 
there will never be a plug-in hybrid plane: no 
craft could hold enough batteries. Liquid fuels 
are unbeatable when it comes to convenience 
combined with compact energy storage. 

That is why funding agencies around the 
world — and at least a few private compa- 
nies — are putting unprecedented resources 
into making fuels using power from the Sun, 
which is not only carbon-free but effectively 
inexhaustible. JCAP stands out not only for its 
scale, but also for its ambition. It is one of five 
Energy Innovation Hubs created by the DOE 
beginning in 2010 to focus on specific prob- 
lems using basic research, applied research and 
engineering. JCAP has promised to deliver a 
working prototype of an artificial leaf by the 
time its initial grant runs out in 2015. 

Although the centre has taken some impor- 
tant steps in that direction — including one 
reported just last week' — itis still along way 
from delivering on that promise, “This is a 
really, really difficult, challenging problem,’ says 
electrochemist John Turner of the US National 
Renewable Energy Laboratory in Golden, Colo- 
rado. “The payback would be huge, but it’s not 
as simple as everyone wanted it to be when we 
started playing in this area 40 years ago.’ 

Still, the surge of funding and attention 
has given many researchers reason to hope 
for long-term success. “If you could sustain 
this type of effort for the next ten years,” says 
Michael Wasielewski, a chemist at Northwest- 
ern University in Evanston, Illinois, “it’s con- 
ceivable you could have a practical solution” 


CATCHING RAYS 

The concept of artificial photosynthesis goes 
back to 1912, but the push to achieve it did 
not start until 1972, when Japanese research- 
ers outlined what a device would need to take 
in sunlight and use it to split water into oxy- 
gen and hydrogen fuel’. Progress was slow. In 
1998, Turner reported’ a complete system that 
showed a major advance — it stored 12% of the 
incoming solar energy as fuel, compared with 
1% of energy stored as biomass in real leaves. 
But it cost more than 25 times too much to be 
competitive, and its performance dropped off 
after 20 hours of sunshine. 


There are three things you want from an 
artificial leaf, says Lewis: “You want it to be 
efficient, cheap and robust. I can give you any 
two today, but not the third at the same time.” 

JCAP’s mission is to fix that problem — and 
in the process, to create a system that is much 


“THE PAYBACK 
WOULD BE HUGE, 
BUTITS NOT AS 
SIMPLE AS EVERYONE 
WANTED IT TO BE.” 


cheaper than just splitting water with electricity 
froma solar panel. At the heart of JCAP’s artifi- 
cial-leaf design are two electrodes immersed in 
an aqueous solution. Typically, each electrode 
is made of a semiconductor material chosen to 
capture light energy from a particular part of 
the solar spectrum, and coated with a catalyst 
that will help to generate hydrogen or oxygen at 
useful speeds (see ‘Splitting water’). Like many 
other artificial-photosynthesis devices, JCAP’s 
system is divided by a membrane to keep the 
resulting gases apart and reduce the risk of an 
explosive reaction. 

Once the water has been split, the hydrogen 
is harvested. It can be used as a fuel by itself 
— perhaps in hydrogen-powered cars such as 
those already making their way into showrooms 
in California — or be reacted with carbon 
monoxide to make liquid-hydrocarbon fuels. 

Making any one of the artificial leaf’s com- 
ponents work well is a challenge; combining all 
of them into a complete system is even harder. 
“This is exactly like building a plane,” says 
Lewis. “You've got to not just have an engine, 
you have to have a design with wings and the 
fuselage and the engine and the avionics — and 
the plane, in the end, has to fly” 

Much of the difficulty comes down to finding 
the right materials. Silicon, for instance, makes 
a good photocathode — the electrode that pro- 
duces hydrogen gas — but is stable only when 
the solution around it is acidic. Unfortunately, 
the situation is reversed with photoanodes, 
which produce oxygen: the good ones are stable 
only when the solution is basic, not acidic. And 
the best catalyst for the oxygen-producing 
electrode, iridium, is both rare and expensive, 

which makes it unsuit- 


> NATURE.COM able for commercial-scale 
Learnabout another — devices. 

Energy Innovation JCAP’s High Through- 
Hub. put Experimentation lab 
go.nature.com/2cdwk7 —_is tackling the materials 
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problem with inkjet printers modified to churn 
out spots of alloys on glass plates for testing as 
catalysts and photoabsorbers. Together, the 
printers are able to produce up to one million 
spots of varying composition per day. 

In one experiment’ to find the best propor- 
tions of nickel, iron, cobalt and cerium oxides to 
generate oxygen from water, the team screened 
nearly 5,500 combinations for stability and 
function using a miniaturized chemical lab that 
glided over the glass plates tirelessly. The best- 
performing combination is not the most effec- 
tive catalyst ever found for this reaction, but it 
is transparent, allowing light to pass through to 
the photoabsorber, and it has good chemical 
compatibility with that material. 

One of the toughest challenges for artificial 
photosynthesis has always been getting a good 
material for the photoanode, says Carl Koval, 
an electrochemist and JCAP’s director. “Those 
things were always horribly unstable, often 
not even stable for minutes.” Many research- 
ers have focused their search on materials 
known to be cheap and stable — certain metal 
oxides, for example — and tried to make them 
into good light absorbers. Others feel that it is 
better to start with materials that are known 
to be efficient light harvesters, and to work at 
making them stable and cheap. 

Just last week, a JCAP team reported’ 
success with the latter approach. By putting a 
protective coating of titanium dioxide on high- 
performing photoabsorbers such as silicon, 
the researchers achieved big gains in stability. 
“That's basically the last piece of the puzzle 
to create the first-generation prototype,’ says 
Koval, who predicts that JCAP will have an 
artificial leaf running in the next few months. 

Publication of a preliminary system includ- 
ing the titanium dioxide coating is in the works, 
says Lewis. “That's going to be a double-digit- 
efficiency, stable system.” The threshold for 
commercial viability is thought to be in the 
10-20% range. The photoabsorbers will not 
be cheap enough to bring to market, concedes 
Lewis, because their cores are made from expen- 
sive single-crystal silicon. But if subsequent 
research shows that cheaper fabrication meth- 
ods work, the system could be cost-effective. 


SPECTRUM OF IDEAS 

JCAP will soon complete its fourth year of 
operations. It got off to a slow start as new labs 
were built, but researchers both in and outside 
the centre praise its systematic focus on pro- 
ducing a practical system, and its progress so 
far. Even Turner, whose lab bid to become the 
solar-fuels hub but lost out to Lewis's team, is 
encouraged by JCAP’s latest direction. 

Still, says Koval, the centre has its critics. 
Some take issue with its emphasis on engineer- 
ing and prototype development. But if JCAP 
were to focus on basic science, he says, it would 
not be “doing what the DOE created the hub 
for in the first place”. Other critics object to 
how JCAP concentrates on just one of several 
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SPLITTING WATER 


Artificial photosynthesis uses photons from sunlight to split water molecules into oxygen and hydrogen, which 
can be used to make fuel. Every two molecules of water yield one oxygen molecule (O,), as well as four pairs 
of protons (H*) and electrons (e-). The protons and electrons migrate across a membrane, where a 


photocathode recombines them into hydrogen using a catalyst plus sunlight. 


Catalyst-covered 
photoanode 
material 


Catalyst-covered 
photocathode 
material 


Fuel < 


possible ways to tackle artificial photosynthesis. 
‘A lot of people would have been happier if the 
DOE had spread the funding around all these 
different ways of doing this,” says Koval. But 
that kind of dilution of effort would be risky in 
its own way, he argues: “Then youd have pro- 
gress on none of them.” 

Besides, many of the alternative approaches 
are being pursued elsewhere. Up the coast in 
Santa Barbara, California, for example, a start- 
up firm called HyperSolar is testing a system 
in which coated nano- or micro-particles 
combining a photoabsorber and a catalyst 
are placed in a transparent, water-filled plas- 
tic bag. The bag will inflate as it is exposed to 
sunlight, and hydrogen and oxygen gas form 
inside. Such units could be deployed in sunny 
regions such as deserts. A 2009 DOE report” 
estimated that, if it uses cheap materials, this 
‘baggie’ approach could produce hydrogen 
economically with 10% efficiency, stable for 
10 years. 

But the system is risky, says Turner, because 
it produces oxygen along with the hydrogen. 
“If youre talking about 100 square miles of 
baggies in the desert with this explosive mix- 
ture,” he says, “one lightning bolt and you have 
a disaster.” HyperSolar researchers are explor- 
ing several ways to eliminate that danger. One 
is to use a system that separates the gases into 
two bags, says Syed Mubeen, a postdoc at the 
University of California, Santa Barbara, and 
lead scientist at the company. Another is to run 
the system using waste water rather than pure 
water, so that the oxygen reacts with organic 
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impurities and degrades them into valuable 
chemicals. This approach “completely removes 
oxygen out of the equation’, says Mubeen. As 
with JCAP’s stable photoanode, HyperSolar’s 
photoabsorber is protected by a coating. 


LIGHT INDUSTRY 
Another entrant in the artificial-photosynthe- 
sis field is the Japan Technological Research 
Association of Artificial Photosynthetic 
Chemical Process (ARPChem), a consortium 
of universities and companies that has gov- 
ernment funding comparable to JCAP’s grant 
— although over ten years rather than five — 
to develop a bag-based approach. Kazunari 
Domen, a chemist at the University of Tokyo 
and leader of ARPChem’s water-splitting 
group, says that one of the companies in the 
consortium has been working on a membrane 
to separate the hydrogen and oxygen products. 
Other projects are making photoabsorbers 
from organic molecules, rather than semicon- 
ductors. Some are building molecular assem- 
blies inspired directly by the photosynthetic 
apparatus of plants. And in the past few years, a 
class of materials called perovskites has drawn 
the attention of the solar-photovoltaic commu- 
nity for its high energy-conversion efficiency; 
some researchers think that the materials also 
have potential in artificial photosynthesis. 
Daniel Nocera, a chemist at Harvard 
University in Cambridge, Massachusetts, 
launched Sun Catalytix to develop his work 
on a low-cost catalyst. But the company 
announced last year that it has put that 
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research on hold to pursue a less challenging 
product with prospects of turning a profit for 
investors sooner. The decision underscores the 
challenges of bringing a commercially viable 
artificial-photosynthesis system to market. 


BERKELEY BUBBLES 

Ona spring day in the arty-industrial district of 
Berkeley, California, researchers demonstrate a 
prototype system inside the temporary lab space 
that houses JCAP’s northern site. As asunlamp 
shines on a CD-sized plastic box, fine streams 
of hydrogen bubbles rise between blue strips of 
catalyst-coated silicon and exit through tubes 
in the box’s top. This prototype system is not 
the team’s best: it won't last and it is not very 
efficient. But it is still encouraging to see cham- 
pagne-like bubbles triggered simply by light. 

Then Karl Walczak, a postdoc in JCAP’s 
prototyping group, slides a second plastic box in 
front of the lamp. Inside is a small black square: 
anew titanium dioxide-coated photocathode. 
This second system immediately begins to gen- 
erate bubbles much faster than the first. “This is 
where the field is going,’ says Walczak. 

JCAP researchers hope that such prototypes 
will ultimately lead to industrial hydrogen- 
production plants. They predict arrays of cells 
kilometres long, with a tower supplying water 
and pipes drawing the hydrogen to a storage 
tank. Some researchers propose that domes- 
tic units may also be part of the future, but 
Lewis warns that the small amount of sunlight 
that falls on a rooftop cannot make enough 
hydrogen to supply a family’s energy needs. 
Others say that the technology could be useful 
in areas of the developing world that lack an 
energy infrastructure, offering distributed fuel 
generation where it is needed. 

In the meantime, researchers at JCAP and 
elsewhere are moving forward on all fronts. 
Devens Gust, a chemist at Arizona State Uni- 
versity in Tempe, echoes a near-universal 
sentiment. “The bottom line,’ he says, “is that 
nobody really knows yet what’s going to win 
out, what’s going to be practical.” 

But whatever technology prevails, says 
Lewis, the logic behind artificial photosynthe- 
sis is inexorable. “The biggest energy source 
we have by far is the Sun,” he says. “The best 
way to store energy other than in the nucleus 
of an atom is in chemical fuels. It’s inevitable 
someone is going to take the biggest source and 
store it in the most dense way.” m SEE EDITORIALP.7 


Jessica Marshall is a freelance writer in 
St Paul, Minnesota. 
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For decades, most researchers 
ignored the leading genetic risk 
factor for Alzheimer’s disease. 

That is set to change. 


BY LAURA SPINNEY 


ne day in 1991, neurologist 
Warren Strittmatter asked his 

boss to look at some bewil- 

dering data. Strittmatter was 
studying amyloid-6, the main 
component of the molecular 

clumps found in the brains of people 
with Alzheimer’s disease. He was hunt- 
ing for amyloid-binding proteins in the 
fluid that buffers the brain and spinal 
cord, and had fished out one called 
apolipoprotein E (ApoE), which had 
no obvious connection with the disease. 
Strittmatter’s boss, geneticist Allen 
Roses of Duke University in Durham, 
North Carolina, immediately realized 
that his colleague had stumbled across 
something exciting. Two years earlier, 
the group had identified a genetic 
association between Alzheimer’s and a 
region of chromosome 19. Roses knew 
that the gene encoding ApoE was also 
on chromosome 19. “Tt was like a light- 
ning bolt,’ he says. “It changed my life” 
In humans, there are three common 


variants, or alleles, of the APOE gene, 
numbered 2, 3 and 4. The obvious step, 
Roses realized, was to find out whether 
individual APOE alleles influence the 
risk of developing Alzheimer’s disease. 
The variants can be distinguished from 
one another using a technique called 
the polymerase chain reaction (PCR). 
But Roses had little experience with 
PCR, so he asked the postdocs in his 
team to test samples from people with 
the disease and healthy controls. The 
postdocs refused: they were busy hunt- 
ing for genes underlying Alzheimer’s, 
and APOE seemed an unlikely can- 
didate. The feeling in the lab, recalls 
Roses, was that “the chief was off on 
one of his crazy ideas” 

Roses then talked to his wife, Ann 
Saunders, a mouse geneticist who was 
skilled at PCR. She had just given birth 
to their daughter and was on maternity 
leave, so they struck a deal. “She did 
the experiments while I held the baby,’ 
he says. Within three weeks, they had 
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collected the data that would fuel a 


series of landmark papers showing 
that the APOE4 allele is associated with 
a greatly increased risk of Alzheimer’s 
disease’. 

Twenty years on, APOF4 remains the 
leading genetic risk factor for Alzhei- 
mer’s, the most common form of demen- 
tia (see ‘Risky inheritance’). Inheriting 
one copy of APOE 4 raises a person's risk 
of developing the disease fourfold. With 
two copies, the risk increases 12-fold. 
Yet Roses’ data were largely criticized or 
ignored. Within a couple of years, inter- 
est in ApoE had dwindled as researchers 
flocked to study amyloid-f. The handful 
of labs that continued to pursue ApoE 
did so in the face of indifference from 
funding agencies and the neuroscience 
community, and without the resources 
needed to validate experimental findings 
with larger studies. 

Today, the function of the ApoE 
protein in the brain remains mostly 
unknown. This neglect of such a strong 
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Over time, 
Alzheimer’s 
disease drastically 
shrinks the brain. 


lead has puzzled some outside the Alz- 
heimer’s field. At a forum on brain dis- 
eases in Frankfurt, Germany, Thomas 
Bourgeron, an autism researcher at the 
Pasteur Institute in Paris, voiced his 
confusion. “If I had a risk factor like 


that, I'd be hot on its trail” 

But interest in the lipoprotein is 
picking up, in part because attempts to 
target amyloid-B have repeatedly disap- 
pointed in major clinical trials. Phar- 
maceutical companies are pulling back 
from amyloid-based approaches and 
some academics have begun to ques- 
tion the focus on the molecule. For the 
first time, researchers are developing 
drugs aimed at the ApoE4 protein and 
drawing attention from industry. 

“The amyloid hypothesis became 
such a strong scientific orthodoxy that 
it began to be accepted on the basis of 
faith rather than evidence,’ says Zaven 
Khachaturian, president of the non- 
profit campaign Prevent Alzheimer’s 
Disease 2020, and former coordinator 


of Alzheimer’s-related activities at the 
US National Institutes of Health. Until 
recently, he says, “no one has stepped 
back to ask the fundamental question 
of whether our basic premise about the 
disease is the correct one”. 


STIFF COMPETITION 

Opinions differ as to why Roses’ find- 
ing was neglected, but many agree 
that bad timing played a part. In 1991, 
John Hardy and David Allsop had pro- 
posed the ‘amyloid cascade hypothesis: 
This posits that Alzheimer’s disease 
results from the abnormal build-up of 
amyloid-f clusters, or plaques, in the 
brain’. Others rallied around the idea 
and it has won most of the funding 
available to the field ever since. 

But Roses did not subscribe to that 
theory. “Amyloid is one of many sub- 
stances that builds up in plaques as a 
result of dying cells and atrophy in the 
brain,” he says. “I never did think it was 
the cause.” In saying so, he may have 
deterred others from investigating a pos- 
sible ApoE-amyloid link, and inadvert- 
ently set up a competition between the 
two hypotheses for funding. He never 
got another grant to work on ApoE. 

But there were also technical obsta- 
cles to ApoE research. The protein 
is found throughout the body, mak- 
ing it difficult to target the molecule 
specifically in the brain. And ApoE is 
bound to fat, so it tends to stick to other 
molecules in biochemical assays, says 
Menelas Pangalos, who leads research 
on small-molecule discovery at Astra- 
Zeneca in Macclesfield, UK, and has 
long had an interest in ApoE. 

Working with such proteins requires 
an intimate understanding of lipid bio- 
chemistry. “If you want to study ApoE 
biology, you really need to devote a 
laboratory to understanding the tech- 
niques,” says neurologist David Holtz- 
man of Washington University in St. 
Louis, Missouri. Holtzman did just that, 
establishing a separate lab dedicated to 
developing techniques for handling lipo- 
proteins in the central nervous system. 

Amyloid was the easier target. Two 
decades of intensive pursuit have 
yielded a range of drugs that alter the 
metabolism of amyloid-f, but these 
have yet to fulfil expectations. Of the 
six drugs that were in phase II or III 
clinical trials in 2012, half have since 
been dropped because of either safety 
concerns or lack of effectiveness. This 
comes against a backdrop of ageing 
populations, overstretched health-care 
systems and a dearth of medications for 
Alzheimer’s disease. “The large number 
of major failed trials in Alzheimer’s is 
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quite frightening,” says Lennart Mucke, 
director of the Gladstone Institute of 
Neurological Disease at the University of 
California, San Francisco. “It has really 
scared off big pharma.” 

The three remaining drug candidates 
that target amyloid-f are currently 
being tested in people with Alzheimer’s, 
as well as in individuals who have a high 
risk of developing the disease but who 
have not yet developed symptoms. 
Imaging studies have shown that the 
brains of high-risk individuals look 
and behave differently from controls 
decades before the onset of Alzheimer’s, 
and long before they start to accumulate 
amyloid-f or lose grey matter’. The tri- 
als will examine whether the drugs pre- 
vent or delay the onset of the disease; 
they are due to wrap up over the next 
six years. There is a growing sense in the 
field — among academics and indus- 
try representatives alike — that these 
efforts are the last chance for the amy- 
loid hypothesis. Amid these concerns, 
the spotlight has swung back to ApoE. 

If the prevention trials fail, it will be 
up to academics to persuade companies 
back to the table with solid preclinical 
and early clinical data, says Mucke. He 
is optimistic that ApoE researchers 
may soon have that leverage. Despite 
the obstacles in this area, there is an 
emerging understanding of how ApoE4 
increases risk, which Holtzman’s and 
Mucke’s groups have explored through 
transgenic mice that they have devel- 
oped to express human forms of ApoE. 

The molecule seems to contribute to 
Alzheimer’s through two distinct path- 
ways, one of which is amyloid-depend- 
ent. In both animals and humans, ApoE4 
strongly promotes amyloid-f deposition 
in the brain, compared with ApoE3, long 
considered the ‘neutral form when 
it comes to Alzheimer’s risk. ApoE2, 
which is considered the protective form, 
decreases the build-up*. “These are 
compelling data,” says Holtzman. 

The other mechanism does not 
involve amyloid. When neurons are 
under stress, they make ApoE as part of 
a repair mechanism. The ‘bad’ ApoE4 
form tends to be broken down into 
toxic fragments that damage the cell's 
energy factories — the mitochondria 
— and alter the cell skeleton. 

The relative contribution of these 
two pathways to Alzheimer’s risk is 
not known, says Holtzman, but he and 
others think that changing a harmful 
form of ApoE into a less damaging one 
might prove a promising therapeutic 
approach. At the Gladstone, cardiovas- 
cular scientist Robert Mahley, working 
with a team including neuroscientist 
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Yadong Huang, has identified small 
‘corrector’ molecules that modify the 
structure of ApoE4 protein to one more 
like that of ApoE3, thereby reducing 
abnormal fragmentation’. 

In cell culture, low concentrations of 
these corrector molecules can reduce 
mitochondrial impairment and neu- 
ronal dysfunction’. They are now being 
tested more rigorously in a range of ani- 
mal models. If the molecules ultimately 
prove safe and effective in humans, 
Mucke foresees a day when doctors will 
prescribe them for people deemed at risk 
of Alzheimer’s, just as statins are offered 
to those with high cholesterol and an 
elevated risk of cardiovascular disease. 


ABOVE AND BEYOND 

Such drugs could also have implications 
beyond Alzheimer’s. “The mitochon- 
drial-impairment hypothesis provides 
a pretty logical and parsimonious expla- 
nation for why ApoE4 does bad things,” 
says Mucke, “not only in the context of 
Alzheimer’s, but maybe also in other 
diseases.” There is evidence that it may 
be a risk factor in Parkinson's disease 
and epilepsy. It is also associated with an 
increased risk of a poor outcome after 
brain injury, and more rapid progression 
of untreated HIV infection. Fifteen bio- 
technology companies are already col- 
laborating with the Gladstone to develop 
these and similar drugs. 

Despite his inability to get grants, 
Roses never gave up on ApoE. But a 
few years after his group discovered the 
link between ApoE and Alzheimer’s, 
he wearied of the constant battles for 
funding. He left academia and spent ten 
years in industry — where he continued 
to work on ApoE, among other things 
— before returning to Duke in 2008. 

In 2009, his group described a stretch 
of non-coding DNA in a gene called 
TOMM240 that sits next to APOE on 
chromosome 19. This stretch of DNA, 
known by the shorthand 523, varies in 
length. The length of 523 can deter- 
mine the extent to which TOMM40 and 
APOE are expressed’. 

The discovery was important, Roses 
says, because the protein encoded by 
TOMMA4O0, called Tom40, is crucial to 
healthy mitochondria. Tom40 forms 
a channel in the outer mitochondrial 
membrane that is used to import pro- 
teins. Without these proteins, mito- 
chondria cannot divide as they should 
throughout a cell’s life. “It’s a big effect 
that’s been known about for a decade,” 
says Roses, “But it’s not well-known in 
the Alzheimer’s field” 

Roses went on to suggest that 523 
could be exploited to develop therapies 


and improved tests for Alzheimer’s risk. 
Most people will develop Alzheimer’s if 
they live long enough, but only about 
25% of people carry an APOEF4 allele. 
As a result, a prognostic test for APOE4 
will only ever be partially informa- 
tive. But genotyping both APOE and 
TOMM40 could provide information 
about a wider swathe of the population, 
Roses says. His group has found, for 
example, that APOE3 — by far the most 
common APOE allele in humans — is 
usually inherited either with a short ora 
very long 523 tract. In those who inherit 
two APOE3 alleles, age of onset differs 
depending on which combination of the 
two 523 variants they also inherit. 

Other labs have found evidence sup- 
porting Roses’ hypotheses, but some 
attempts to replicate his TOMM40 find- 
ings have failed. In 2012, Hardy, now at 
University College London, and a col- 
league, geneticist Rita Guerreiro, wrote 
an editorial’ in which they argued that 
TOMM240 did not independently affect 
Alzheimer’s risk. 

Roses faith in his hypothesis has not 
wavered: he believes he has a sound 
mechanistic explanation for his find- 
ings. And he says that the genome- 
wide studies that failed to reproduce his 
results lacked sufficient power to reveal 
the association between TOMM40 and 
Alzheimer’s disease. Khachaturian says 
a proper test of Roses’ findings — using 
Roses’ methods in a larger cohort of 
patients — has not yet been done. 

Roses hopes to soon be able to back 
up his findings with more clinical data 
and has launched a company called 
Zinfandel Pharmaceuticals in Durham, 
fuelled in part by his own funds. Along 
with the Japanese pharmaceutical com- 
pany Takeda, based in Osaka, Zinfan- 
del is currently funding a phase III trial, 
called TOMMORROW, that will put 
his ideas to the test. TOMMORROW 
is expected to run for about 5 years 
and will recruit close to 6,000 healthy, 
elderly individuals. It will evaluate a 
risk-assessment algorithm based on 
age, APOE and TOMM40. 

The trial will also investigate whether 
a low dose of a drug called pioglita- 
zone — already approved at much 
higher doses for certain patients with 
type 2 diabetes — can delay disease 
onset in those individuals deemed by 
the algorithm to be at high risk of Alz- 
heimer’s. Evidence from animal and 
small-scale human studies suggests that 
pioglitazone may prevent or reverse 
Alzheimer’s-related pathology and 
symptoms’. Roses thinks it may do so 
by stimulating mitochondria to divide. 

The ongoing trials could have major 
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RISKY INHERITANCE 


People who carry the gene variant APOE4 tend to develop 
Alzheimer’s at a younger age than those with two copies of APOE3. 
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“No one has stepped back to ask 
whether our basic premise about 
the disease is the correct one.” 


consequences even without yielding 
a cure: research has shown that an 
intervention that could delay the onset 
of Alzheimer’s by just 2 years would 
result, 50 years later, in nearly 2 million 
fewer cases of the disease in the United 
States than projected otherwise’’. And 
the results coming in over the next few 
years could force researchers to re-eval- 
uate their understanding of dementia. 
It is time it was recognized for what it 
is, says Khachaturian: a failure of com- 
plex, interacting physiological systems. 
Looking at any one of these systems — 
even those involving ApoE4 — in isola- 
tion is unlikely to fully explain changes 
in behaviour. “The field is going to 
recognize the limitations of current 
approaches and step back,” he says. 
“And if were honest with ourselves, 
we'll start forging new directions.” = 


Laura Spinney is a freelance writer in 
Lausanne, Switzerland. 
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IMAGINECHINA/CORBIS 


In Lanzhou, China, 700 mountains are being levelled to create more than 250 square kilometres of flat land. 


Accelerate research 
on land creation 


China’s campaign to bulldoze mountains to build cities needs expertise to 
avert geoengineering problems, warn Peiyue Li, Hui Qian and Jianhua Wu. 


( ‘esis cities are expanding rapidly 
as its economy grows’. Yet land for 
development is in short supply, par- 

ticularly in mountainous areas, where about 

one-fifth of the population lives’. In the past 
decade, local governments have begun 
removing the tops of mountains to fill in 
valleys to create land on which to build. 
This is reminiscent of an ancient Chinese 
fable, ‘The Foolish Old Man who Removed 
the Mountains’ In the tale, a 90-year-old 


man convinces his disbelieving neighbour 
that he can dig away, stone by stone, two 
mountains that block the way from his house. 
Because he succeeds (albeit with the help of 
deities) the fable is often cited — including 
by Mao Zedong — to illustrate the power of 
perseverance. But in our view, China should 
heed the story’s title: earth-moving on this 
scale without scientific support is folly. 

In cities such as Chongqing, Shiyan, 
Yichang, Lanzhou and Yanan, tens of square 


© 2014 Macmillan Publishers Limited. All rights reserved 


kilometres of land have been created’ (see 
‘Moving mountains’). One of the largest pro- 
jects, which started in April 2012 in Yan’an 
in the Shaanxi province, will double the city’s 
current area by creating 78.5 square kilome- 
tres of flat ground*. Local officials are opti- 
mistic that the sale or leasing of the new land 
will generate billions of renminbi, while eas- 
ing pressures on agriculturally or culturally 
valuable land elsewhere. 
But the consequences of these 
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> unprecedented programmes have not 
been thought through — environmentally, 
technically or economically. There has been 
too little modelling of the costs and benefits 
of land creation. Inexperience and techni- 
cal problems delay projects and add costs, 
and the environmental impacts are not being 
thoroughly considered. Yaran, for example, 
is the largest project ever attempted on loess, 
thick million-year-old deposits of wind- 
blown silt. Such soft soils can subside when 
wet, causing structural collapse’. 

Overcoming these problems will require 
the sharing of experiences and technical 
strategies nationally and internationally. 
Collaboration between universities, research 
institutes, private companies, construc- 
tion teams and governments is needed to 
ensure that the best research informs the 
engineering, so that the vast sums of public 
money involved are invested wisely. Without 
cooperation, the land-creation campaigns 
could drain local and national economies 
and irrevocably damage watercourses and 
ecosystems. 


TOO LITTLE, TOO LATE 

Land creation by cutting off hilltops and 
moving massive quantities of dirt is like 
performing major surgery on Earth’s crust”. 
Mountaintop moving has been done before 
in strip mining, especially in the eastern 
United States”. But it has never been carried 
out on this scale. In China, dozens of hills 
100-150 metres in height are being flattened 
over hundreds of kilometres. Such infill has 
never been used for urban construction. 
There are no guidelines for creating land in 
the complex geological and hydrogeological 
conditions that are typical of mountainous 
zones. 

Land-creation projects are already caus- 
ing air and water pollution, soil erosion and 
geological hazards 
such as subsidence. “Government 
They destroy forests funds are 


and farmlands and_ not allocated 
endanger wild ani- according 
mals and plants. The fo scientific 
city of Shiyan, for significance 
example, lies near grneed.” 


the headwaters of the 

South-North Water Transfer Project — a 
major initiative to divert water from rivers 
through canals in southern China to Beijing 
and the north. There, the changing of hills 
to plains has caused landslides and flooding, 
and altered watercourses. This causes greater 
soil erosion, increasing the sediment content 
of local water sources (see go.nature.com/ 
klasxh; in Chinese). 

In Yanan, the air is often brown with dust 
owing to construction teams working on 
windy days without dampening the soil. 
Forests and plants on hills and in gullies are 
stripped ahead of the demolition and filling. 
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MOVING MOUNTAINS 


Tens of square kilometres of land have been created by levelling 
hills in several cities in mainland China. Work began in 2012 to 
make nearly 80 square kilometres of flat ground in Yan’an. 


| assessment still pending. 


ca 


This increases the risk of soil erosion and 
groundwater loss, because farmlands and 
forests block wind and retain moisture and 
soil grains. During the earth-moving pro- 
jectin the city of Lanzhou’, soil erosion is 
expected to increase by 10% and concentra- 
tions of dust particles in the air by 49%. 

Many land-creation projects in China 
ignore environmental regulations, because 
local governments tend to prioritize mak- 
ing money over protecting nature. When 
air pollution associated with the Lanzhou 
project became visible in April 2013, work 
was halted pending an environmental 
assessment. Four weeks later, construction 
resumed because of mounting costs to the 
local government and contractors. The 
assessment has yet to be completed. 

The assumption that land creation always 
brings economic benefits needs to be chal- 
lenged through cost modelling. For example, 
the 100-billion-renminbi (US$16-billion) 
total cost over ten years of the Yan’an plan 
could take decades to recoup. The infill area 
cannot be built on for at least another dec- 
ade, when its ground base becomes stable. 
This long payback period will dissuade some 
investors from buying the land, thus increas- 
ing economic risk. 


INADEQUATE INSIGHT 

Research into land creation is insufficient. A 
lack of expertise, collaboration and funding 
means that scientists have not been able to 
conquer technical problems or provide tested, 
efficient or timely support for the engineer- 
ing projects. Basic research into the hydraulic 
parameters of soils and aquifers was stalled by 
a year-long delay in distributing government 
research funding for 2013. 
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| Air pollution halted construction; 
| work resumed despite environmental 
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Soft soil could subside when 
wet, causing structural collapse. 


Research is local and narrow. In Yan’an, 
the Chinese government has granted 
30 million renminbi to several local insti- 
tutes and universities to solve some tech- 
nical problems. The research focuses on 
the monitoring and modelling of ground 
deformation, subsidence, hydrogeological 
changes and protection of the surrounding 
slopes. Economics and environmental risks 
are not included. 

The number and breadth of research 
teams is inadequate, and the limited experi- 
ence and skills of local scientists are a prob- 
lem. Researchers in Yanan, for example, have 
expertise in hydrogeology and engineering 
geology but are less experienced in soil- 
water research, which is important for the 
stability of the ground base. Institutes have 
struggled to collaborate. Each manages its 
own sub-project and seldom shares research 
data with others. Teams must repeat work 
to obtain data that another institute already 
has, wasting time and money. 

No international organizations or scien- 
tists who have rich expertise and practical 
experience are participating. Those who 
have participated in mountaintop removal 
in the United States should be invited to take 
part in Chinese projects. 

Research funding comes exclusively 
from the government. Industries, institutes 
and universities carry out the research, 
yet few governmental departments are 
involved, limiting their influence. In 
Yan’an, only the Shaanxi Science and 
Technology Department and the Chinese 
Ministry of Science and Technology have 
participated or provided funding. Other 
local government departments involved 
in areas such as housing or environmental 


DIGITALGLOBE 


protection are not involved. 

Government funds are not allocated 
according to scientific significance or need. 
Rather, they are directed on discretion of 
the local institute in charge of the project. 
For instance, water-soil interaction is the 
key factor affecting the success of the Yan'an 
project but the drive to understand it attracts 
only 12% of the research budget. And envi- 
ronmental and ecology-related research 
receives no funding in Yaran, despite being 
very much in the public interest. 

Research also lags behind the engineering 
work, and so has little impact. In Yar“an, the 
research started three months after excava- 
tions began. Lab tests that could have estab- 
lished the exact moisture content needed to 
harden loess foundations were unavailable 
to guide the project. Preliminary results are 
now available, but for only a few soils. An 
earlier start could have delivered more. 


BOOST RESEARCH 

We call on the Chinese government to accel- 
erate research efforts urgently. With good 
guidance, land creation could proceed with 
fewer risks and more benefits. 

China’s best researchers from Beijing, 
Shanghai and Wuhan should join regional 
collaborations. And a national network, 
similar to the National Basic Research Pro- 
grams of China, should be established to 
coordinate this effort, to ensure quality and 
avoid duplication. 

Scientists from the US Geological Survey, 
the US Environmental Protection Agency, 
the International Association of Hydro- 
geologists and universities in Canada, the 
United States and Europe should also be 
brought in. Local construction industries 


* 


Satellite images of western Shiyan, China, in 2010 (left) and 2012 (right) after several peaks have been flattened. 


could be involved and provide funds and 
equipment. 

Local and national governments should 
allocate funds according to scientific impor- 
tance, guided by of a pool of national experts 
including hydrogeologists, environmental- 
ists, economists and government officials. 
Extra funding must be found for environ- 
mental and ecological risk assessment and 
for economic studies of land creation. An 
independent management organization 
should be established and supervised by 
local government to manage and share the 
research data. 

Environmentalists, ecologists, hydro- 
geologists, engineers and decision-makers 
must collaborate to find solutions to 

environmental prob- 


“Economists lems — including the 
needtoassess disappearances of 
the costs and small creeks, endan- 
benefits of gering of wild ani- 


mals and birds and 
altered groundwater 
flows. 

As a minimum the following recom- 
mendations should be heeded: during earth 
movement, soils should be watered to raise 
the moisture content to avoid dust pollu- 
tion; ground cover and trees on hills and 
in gullies that are destroyed before infilling 
should be replanted as soon as possible to 
prevent soil erosion; and farmers who lose 
farmland should be compensated. 

Economists need to assess the costs and 
benefits of land-creation projects and sug- 
gest ways to reduce the risks before these 
projects begin. Where high economic risks 
and low profitability are predicted, pro- 
jects should not proceed even if they are 


land-creation 
projects.” 


© 2014 Macmillan Publishers Limited. All rights reserved 


technically feasible, at least for now. Where 
there is no profit, governments should be 
dissuaded from going ahead. 

Consulting environmental scientists 
and economists in advance will prevent or 
minimize extra costs arising from delays or 
troubleshooting. Pausing the Yan’an excava- 
tions would cost 500,000 renminbi a day in 
payments to idle construction workers, for 
instance’. 

As in the ancient fable, China is moving 
mountains. Until we know more about the 
consequences, we urge governments to seek 
scientific advice and proceed with caution. = 


Peiyue Li is assistant professor of 
hydrogeology and environmental science 

in the School of Environmental Science and 
Engineering at Changan University, China. 
Hui Qian and Jianhua Wu are in the School 
of Environmental Science and Engineering 
at Changan University, China. 
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BOOKS & ARTS 


William Astbury used X-ray crystallography to unravel the changeable structures of proteins. 


| MOLECULAR BIOLOGY | BIOLOGY 


Of DNA and 
broken dreams 


Georgina Ferry weighs up a life of William Astbury, who 
had a forgotten role in pinning down the double helix. 


istory is written by the victors, in 
H science as in war. The story of the 

race to uncover the structure of 
DNA was thrust into the public imagina- 
tion by James Watson's highly partial 1968 
account, The Double Helix. Ever since, his- 
torians have been filling in the gaps. Most 
notably, successive authors have corrected 
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Watson's unkind portrait of Rosalind Frank- 
lin, whose X-ray-diffraction photograph of 
DNA (‘Photo 51’) was crucial to the model 
developed by Watson with Francis Crick. 
Now science historian Kersten Hall has 
disinterred another figure from the footnotes 
of the DNA story, and brought him into the 
limelight. William Astbury — known to 
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all as Bill — was part of the group of X-ray 
crystallographers who learned their craft at 
the feet of William Henry Bragg, the father 
of the field, at London’s Royal Institution in 
the 1920s. Astbury, along with fellow stu- 
dent J. D. Bernal, was fired with the new 
technique’s possibilities for studying the key 
molecules of life, especially the proteins that 
are fundamental to all living tissue. 

Bernal left the Royal Institution to found a 
lab at the University of Cambridge, UK; Ast- 
bury was banished (as he saw it at the time) 
to the University of Leeds in northern Eng- 
land. There he applied X-ray analysis to the 
study of natural fibres such as wool and hair: 
Leeds was an important centre of the textile 
industry. The wool protein keratin, Astbury 
complained on his appointment, was “lifeless 
and uninteresting”. Later, he would tell public 
audiences that wool had given him “a glimpse 
of the loom on which the web oflife is woven”. 

In the early 1930s, Astbury obtained 
X-ray-diffraction photographs of stretched 
and unstretched wool fibres, which suggested 
that the protein chain could switch between 
a compact and an extended form. Apart 
from shedding light on the elasticity of wool, 
his discovery showed that the properties of 
long-chain proteins depended on their three- 
dimensional shape, and that changes in shape 
could have functional consequences. 

Hall makes a good case for Astbury’s con- 
tribution to protein-structure analysis, but the 
book’s subtitle signals that the author is really 
interested in DNA. In 1938, Astbury and his 
PhD student Florence Bell — a protégée of 
Bragg’s son Lawrence, and now largely forgot- 
ten — published the first X-ray photographs 
of DNA fibres. They were blurred and hard to 
analyse, because the Leeds team did not know 
that two forms of DNA were mixed in their 
sample. But they contributed to Watson and 
Crick’s thinking, and encouraged Maurice 
Wilkins — and subsequently Franklin — at 
King’s College London to undertake X-ray 
studies of the molecule. 

In 1951, Astbury’s “lab boy” turned doc- 
toral student, Elwyn Beighton, produced 
X-ray photos of DNA 
that were almost 
identical to Franklin’s 
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bury never published | @ } 
Beighton’s pictures 
or presented them in 
public, nor did he seem 
to recognize that the 


distinctive X-shaped 


The Man in the 


diffraction pattern Monkeynut Coat: 
showed that DNA was de i ihe eal 
° aa is clearly frus- poate: 

Double-Helix 
trated that Astbury = xersten THALL 
missed the chance to Oxford University 
wina Nobel Prize and Press: 2014. 
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change the face of biology. He dwells at length 
on what might account for this failure. Ast- 
bury had become depressed and embittered 
as he saw others in London and Cambridge 
win funding that he had failed to secure. He 
had conceded the DNA problem to Wilkins 
at King’s. He may not have kept up with the 
theory of X-ray diffraction. Finally, Hall sug- 
gests, Astbury probably saw DNAs apparently 
fixed structure as uninteresting compared to 
the conformational changes in proteins; at 
the time, his main interest (and the subject of 
Beighton’s thesis) was bacterial flagella. 

Hall indulges in some counterfactual 
speculation. What if Astbury had shown 
Beighton’s pictures to the US chemist — and 


later Nobel laureate 


“Astbury’s — Linus Pauling, 
student produced who stayed with 


X-rayphotosof Astbury in Leeds 
DNA that were in 1952? Pauling 
almost identical might have seen 


to Rosalind their significance 
Franklin’s, taken and realized Wat- 
the following son and Crick’s 
year. 3 worst fears by 

beating them to the 


structure. But it didn’t happen, and Watson's 
1968 book credits Astbury only with the “one 
half-good photograph” taken by Bell in 1938. 

Hall tells his story with style and pace. But 
Iam unconvinced by his central contention, 
that Astbury was a “titan” and the founder of 
molecular biology. Astbury was one of many 
who independently promoted a molecular 
approach to understanding living things. 
Success in science depends as much on per- 
sonality as on intellect, and here Hall leaves 
us largely in the dark. Clearly Astbury was 
a larger-than-life figure, a good communi- 
cator and entertaining in company (if you 
liked “off-colour jokes”), yet not favoured by 
the scientific establishment. Crystallogra- 
phy pioneer Dorothy Hodgkin (who taught 
chemistry to Astbury’s daughter Maureen at 
the University of Oxford, UK — a connection 
not mentioned by Hall) once described him 
affectionately as “very bad and very amusing”. 

And the monkeynut coat? Astbury 
worked with the now defunct UK company 
Imperial Chemical Industries to develop 
fibres made from a protein derived from 
peanuts, and proudly wore a coat made from 
‘Ardil’ It proved to be no cheaper and con- 
siderably less hard-wearing than wool, and 
never took off. Hall uses this story to launch 
a final chapter exploring Astbury’s prescient 
reflections on the manipulation of biological 
materials for utilitarian ends. He did not live 
to see the age of biotechnology, but he was 
certainly one of its prophets. m 


Georgina Ferry is a science writer based in 
Oxford, UK, and author of biographies of 
Dorothy Hodgkin and Max Perutz. 

e-mail: mgf@georginaferry.com 


Books in brief 


Do Fathers Matter?: What Science Is Telling Us About the Parent 
We’ve Overlooked 
Paul Raeburn FARRAR, STRAUS & GIROUX (2014) 
When science journalist Paul Raeburn first assessed the available 
research on fathers, he found the pickings decidedly slim. Scientists 
are now redressing the balance, and Raeburn has rounded up key 
findings in developmental psychology, evolutionary biology, genetics 
| and neuroscience. Prepare for a bracing walk through the myriad 
| ways fathers matter, from the “genomic battle of the sexes” that can 
lead to differing syndromes in offspring, to a father’s ‘destabilizing’ 
—and hence stimulating and educative — play with his children. 


The Island of Knowledge: The Limits of Science and the Search for 
Meaning 

Marcelo Gleiser BASIC Books (2014) 

Among the bright knowns of science are countless unknowns, 
untouched by the most exquisitely calibrated, powerful instruments 
yet invented. Physicist Marcelo Gleiser sets out to explore the extent 
to which we can make sense of reality. He maps humanity’s “island 
of knowledge” from pre-Socratic cosmology to quantum mechanics. 
And he marks how mysteries proliferate with each finding — a 
perpetual blur in our cosmic view that, like mathematician Kurt 
Gédel’s work on incompleteness, spurs scientific creativity. 


a The Consolations of Economics: How We Will All Benefit from the 


New World Order 
iT y Gerard Lyons FABER AND FABER (2014) 

HE The economic conflagration of 2008 inspired plenty of bravura ideas 
CONSOLATIQ on how to fireproof the future. Economist Gerard Lyons, who saw it 
WF ECONOMIes all coming, offers a gloom-free reading of causes and correctives, 
Mae A gy and predicts global growth led by multiple economies. Drawing on 
GERARD Yous key case studies and long experience with China, he examines world 
i) ff j economic drivers, ‘soft’ and ‘hard’ power (a country’s influence 


_ exerted through cultural appeal, or by military or economic means), 
and the economic goals and strategies that foster global stability. 


Gotham Unbound: The Ecological History of Greater New York 
Ted Steinberg SIMON & SCHUSTER (2014) 

How did Mannahatta, a wild expanse of hills and mudflats on North 
America’s Atlantic coast, become the high-density, concrete-coated 
city of New York? Ted Steinberg’s environmental history traces the 
stages from Henry Hudson’s 1609 discovery of a “drowned estuary” 
to the ravages of Hurricane Sandy in 2012. Centuries of extreme 
land-use changes — reclaiming underwater terrain, levelling hills, 
draining lakes and crafting ersatz terra firma from landfill — have 
created an ecologically scarred city vulnerable to further storms. 
Other coastal megacities should take heed. 


Human Evolution: A Pelican Introduction 

Robin Dunbar PELICAN Books (2014) 

The British Academy’s ‘Lucy to Language’ project has spawned 
major sociobiological findings, most recently aired in Thinking Big 

by Robin Dunbar, Clive Gamble and John Gowlett (see Nature 509, 
284-285; 2014). Dunbar draws on that research — in particular 

the social-brain hypothesis and time-budget models — for this solid 
primer on human evolution under the relaunched Pelican imprint. 
On the journey from the australopithecines to the Neolithic and what 
followed, Dunbar is an accomplished guide. Barbara Kiser 


5 JUNE 2014 | VOL 510 | NATURE | 33 


©) 2014 Macmillan Publishers Limited. All rights reserved 


| COMMENT | BOOKS & ARTS 


Standards for children’s weight began to be established at the turn of the twentieth century. 


The medicalization of fat 


A history of the shifts in US society’s view of the larger 


child grips David Katz. 


etween the moment I began to read 
Be= Dawes’s Childhood Obesity in 
America and my completion of this 
review, two high-profile studies came out on 
US childhood obesity. The first suggested a 
slight decline in obesity rates among 2-5 year 
olds, spawning headlines claiming that over- 
all childhood-obesity rates had plummeted 
(C. L. Ogden et al. J. Am. Med. Assoc. 311, 
806-814; 2014). The second reported that 
severe obesity in children aged 2-19 was ris- 
ing (A. C. Skinner and J. A. Skelton J. Am. 
Med. Assoc. Pediatr. http://doi.org/svw; 2014). 
Such wild gyrations say something about 
our emotionally charged relationship with 
this topic. However intense the preoccupa- 
tion with childhood obesity, it is not an exclu- 
sively modern phenomenon. What differ 
are the medical, cultural and moral lenses 
through which we view it, and its salience in 
modern epidemiology. Dawes acknowledges 
that much of the prevailing attention to child- 
hood obesity is “a lament of modernity’, and 
she has 100 years of carefully collated evi- 
dence to prove it. Through anecdotes and an 
overview of social trends, she provides the 
historical context for our fixation, revealing 
shifting cultural perceptions, medical pre- 
occupations, scientific advances and eco- 
nomic forces linked to the phenomenon. By 
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clarifying where we 
have been, Dawes aims 
to guide us forward. 
Adiposity has at 
times been admired. In 
Renaissance Europe, it 
was synonymous with 
wealth; in the Baroque 


period it was an attrib- 

ute of feminine beauty. Childhood Obesity 

And it was valued inAmerica: 

in the United States Biography of an 

around the start of the aipeeier = 

twentieth century, as ee 

Dawes shows. Chats i liaicea ad 
Press: 2014. 


Bilcher, for instance, 

was a Pennsylvanian boy whose bulk was 
a source of pride to his parents — and to 
the nation when, in 1896, he won the title of 
“fattest boy in the world”. 

As Dawes shows, when the extra-large 
child was rare, she or (mostly) he was likely 
to be acclaimed. With nearly one-fifth of US 
children and adolescents now obese, the tide 
of opinion has turned. A tendency to appreci- 
ate bulk may have origins in the evolutionary 
struggle to ensure that children had enough 
to eat. Around the time of Bilcher’s win, the 
girth of US-born versus immigrant children 
was a national preoccupation. “Better Babies” 
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contests became popular, fuelled in part by 
the “more abundant comforts” proffered by 
US industry. By showing how recently such 
a stance prevailed, Dawes shines a light on 
an important source of cultural ambivalence 
about the ‘war’ on obesity. Maybe we are yet to 
win because we are fighting with half heart. 
Dawes shows how much time and effort 
in the early twentieth century was devoted 
to generating measures of obesity that we 
take for granted today. A focus on length and 
weight standards was established with physi- 
cian Luther Emmet Holt’s 1894 The Care and 
Feeding of Children, and measures such as the 
Ponderal index — precursors to the body 
mass index — emerged. The more reliably 
obesity could be measured, the more it mat- 
tered: measuring informs norms; norms are 
used for comparison; and comparisons illu- 
minate outliers. Dawes traces the emergence 
of paediatrics as a medical speciality in the 
1870s, advances in epidemiology such as the 
capture of population-level data to establish 
reference standards for height and weight, 
and a growing respect for objective measures. 
To me, the most fascinating portion is 
the chronological sequence of ideas about 
the causes and cures of childhood obesity, 
because they reveal how often we have gone 
back to the future. In Diet and Health (1918), 
the first best-selling US diet book, physician 
Lulu Hunt Peters aired the idea of the calorie 
as a factor in obesity. The glandular theory of 
obesity was a prominent paradigm into the 
1940s: a 1937 case series showed that eight 
obese children had been successfully treated 
with pituitary-gland extracts and thyroid 
tablets. Today, debate still simmers about 
the importance of the endocrine system in 
obesity, with some arguing that hormonal 
imbalance explains the condition so well that 
calories cease to matter. Dawes’s chronology 
also shows how there has been a focus on one 
cause or remedy to the exclusion of others. 
When ‘glands’ were to blame, psychology was 
largely ignored. When psychoanalysts held 
sway, endocrinologists were pushed aside. In 
Dawes’ final chapters, she hints at a potential 
solution through a synthesis of such ideas. 
Dawes is neither a clinician nor a public- 
health practitioner; she is a historian of medi- 
cine. But perhaps health professionals’ efforts 
to contain, control, prevent and reverse child- 
hood obesity have been constrained by a fail- 
ure to see and consider the full scope of the 
threat and the best defences — that is, by the 
focus on one idea at a time. Dawes replaces 
such parochial perspectives with a window 
100 years wide. May it help show us the way. = 


David Katz is the founding director of Yale 
University Prevention Research Center in 
Derby, Connecticut, and editor-in-chief 
of Childhood Obesity. His latest book is 
Disease Proof. 

e-mail: david.katz@yale.edu 
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Design animal 
studies better 


Steve Perrin argues that animal 
models of human disease should 
be better characterized to avoid 
false-positive results (Nature 507, 
423-425; 2014). Poor design, 
execution and reporting of studies 
are pervasive contributors as well. 

For example, randomization 
and blinding in animal studies 
is pitifully infrequent (see, for 
instance, J. P. Ioannidis et al. 
Lancet 383, 166-175; 2014). 
Randomly allocating animals to 
experimental and control groups 
makes the experimental groups 
as similar as possible in other 
respects. Blinding promotes 
comparable handling and 
measurement by experimenters, 
and publicly preregistering 
animal-study protocols and 
the outcomes to be measured 
would identify and reduce bias in 
reporting results. 

Creating largely homogeneous 
experiments aids reproducibility 
and boosts statistical power, but 
has a cost of generalizability: the 
few drugs that have translated 
successfully from animals are 
effective across a broad range of 
circumstances (see, for example, 
E. S. Sena et al. J. Cereb. Blood 
Flow Metab. 30, 1905-1913; 
2010). 

Funders and ethics committees 
need to ensure that study 
designs include these features, 
and journals should make them 
criteria for publication if they 
have not done so already. 
Malcolm R. Macleod* University 
of Edinburgh, UK. 
malcolm.macleod@ed.ac.uk 
*On behalf of 6 co-authors (see 
go.nature.com/js4uzj for full list). 


Investment suggests 
RNAi is on the up 


If pharmaceutical investment 

is a measure of the prospects 

for therapeutics that use RNA- 
interference (RNAi) technology 
for gene silencing (Nature 508, 
443; 2014), on balance, things are 
looking up. 


Yes, Novartis shuttered 
its RNAi effort and Merck 
sold its RNAi programme to 
Alnylam Pharmaceuticals. 
These were the right decisions: 
Novartis and Merck are both 
accomplished companies, but 
their decade-long efforts in 
RNAi failed to yield any clinical 
programmes. 

What you did not mention 
was Genzyme's investment in 
Alnylam in early 2014 — one of 
the largest and most significant 
strategic transactions in recent 
biotech history. Genzyme, a 
Sanofi company, purchased a 
12% stake in our company in 
return for broad product rights 
in the field of rare diseases in 
regions outside North America 
and western Europe. 

Companies such as Genzyme 
do well to partner with 
technology innovators such 
as Alnylam for access to their 
products, not to the technology 
platform. This is how drug 
companies ultimately had 
success with recombinant DNA 
and monoclonal antibodies. The 
same story could be emerging 
with RNAi. 

John Maraganore Alnylam 
Pharmaceuticals, Cambridge, 
Massachusetts, USA. 
jmaraganore@alnylam.com 


Mine centuries-old 
citizen science 


Historical data on biodiversity 
would be valuable for 
investigating the long-term 
impact of human activities. 
Contrary to popular belief, such 
data have been widely collected 
for several hundred years 
through initiatives that would 
today be described as ‘citizen 
science. 

For example, in the late 
sixteenth century, the Spanish 
government distributed 
questionnaires known as 
relaciones topogrdaficas to each 
village, with local inhabitants 
providing a compilation of 
natural-history knowledge. The 
637 questionnaires that survive 


include information on some 
190 species of wild animals and 
plants, gleaned from more than 
4,300 individual records. 

Also, the 16-volume 
geographical dictionary edited 
by the statistician and politician 
Pascual Madoz in the mid- 
nineteenth century contains 
information on most Spanish 
population centres, rivers 
and geographical landmarks. 
Madoz’s enterprise involved 
the collaboration of more than 
1,000 citizens and provided 
several thousand records of 
wild plants and animals, as yet 
unused. 

Similar historical data sets also 
exist from China, most European 
countries and their former 
colonies. These can be biased 
towards socio-economically 
important species, but this 
bias can help to minimize 
misidentifications, for instance. 

We call for an international, 
multidisciplinary effort 
(including historians, linguistic 
experts, geographers and 
biologists) to locate, compile and 
contextualize these invaluable 
historical data for incorporation 
into global biodiversity 
databases. 

Miguel Clavero, Eloy Revilla 
Donana Biological Station — 
CSIC, Seville, Spain. 
miguelclavero@ebd.csic.es 


Moon mapped by an 
artist’s impression 


Thomas Harriot mapped 

the Moon before Galileo 

(J. Rampling Nature 508, 39-40; 
2014). Unlike Galileo, he did not 
interpret what he saw as craters, 

mountains and valleys (see 

S. Pumfrey Notes Rec. R. Soc. 63, 
163-168; 2009). 

Galileo’s experience in the 
visual arts gave him a theoretical 
background in mathematical 
perspective (see, for example, 

S. Y. Edgerton The Heritage 

of Giottos Geometry 233-239, 
Cornell Univ. Press, 1991; and 
S. E. Booth and A. Van Helden 
Sci. Context 13, 463-486; 2000). 


This included using shadows 
to determine the volume and 
location of an object. Observing 
the Moons surface with his 
perspicillum, later named 
telescope, Galileo's training as a 
draftsman led him to interpret 
variations in light on the surface 
as shadows produced by the Sun 
on a complex topography. 
Harriot did not grasp the 
significance of these dark areas. 
He interpreted them simply as 
surface markings that he used to 
‘map’ the Moon. Art and science 
have sometimes benefited each 
other in ways that we fail to 
appreciate. 
José Rafael Martinez Enriquez 
National Autonomous University 
of Mexico, Mexico City, Mexico. 
enriquez@unam.mx 


Experimentation 
needs theory, too 


John Skoyles emphasizes 

the importance of practical 
experimental work for the 
developing scientist (Nature 
508, 319; 2014). But theory is 
crucial too — for interpreting 
the results and for advancing 
research. 

In 1928, British physicist Paul 
Dirac came up with his equation, 
which predicted the existence 
of the positron. The particle 
was discovered independently 
four years later by US physicist 
Carl Anderson at the California 
Institute of Technology in 
Pasadena. This is a good example 
of “theory guides; experiment 
decides” (the motto of analytical 
chemist Izaak Maurits Kolthoff). 

Oras the astrophysicist Arthur 
Stanley Eddington declared: 

“Tt is also a good rule not to put 
overmuch confidence in the 
observational results ... until 
they are confirmed by theory.” 

Perhaps the last word should 
come from Soviet physicist Peter 
Leonidovich Kapitza. He said: 
“Theory is a good thing, but a 
good experiment is forever.” 
Min-Liang Wong National 
Chung-Hsing University, Taiwan. 
mlwong@dragon.nchu.edu.tw 
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OBITUARY 


Gerald Guralnik 


(1936-2014) 


Physicist who helped to conceive the Higgs boson. 


erald Guralnik is best known for 
(S" work on the Higgs mechanism, 

which confers mass on particles. 
Along with particle physicist Carl Hagen 
and myself, he wrote one of three key 
papers published in 1964 that each took 
different routes to solving a conundrum of 
how particles interact. 

Guralnik died ofa heart attack on 26 April, 
aged 77, shortly after finishing a lecture at 
Brown University in Providence, Rhode 
Island, where he had been a faculty mem- 
ber since 1967. He was born in Cedar Falls, 
Iowa, where his parents ran an accountancy 
firm. Guralnik attended the Massachusetts 
Institute of Technology (MIT) in Cam- 
bridge. He then went on to Harvard Univer- 
sity, also in Cambridge, to study for a PhD 
under Walter Gilbert, who was working on 
the theory of elementary particles before 
embarking on his Nobel-prizewinning 
experiments for DNA sequencing. 

In 1964, Guralnik completed his doctor- 
ate in quantum field theory, which models 
subatomic particles as physical fields that 
extend throughout space. That same year, 
he won a US National Science Foundation 
fellowship to a theoretical physics research 
group, founded by Gilbert’s former adviser 
Abdus Salam, at Imperial College London. 

In the years after the Second World War, 
the success of quantum electrodynamics, 
which describes the electromagnetic force 
that drives interactions between matter and 
light, had spurred work on similar theories 
of other interactions. A key goal was a uni- 
fied theory of the electromagnetic force and 
the weak force, which is observed in radio- 
active B-decay and the Sun's core. The big 
problem was that to explain the short range 
of the weak forces, the bosons transmitting 
them had to have large masses. By contrast, 
the electromagnetic force is carried over 
vast distances by the massless photon. 

When I met Guralnik in 1964 at Impe- 
rial, I was interested to learn that he, too, 
was concerned with the problem of boson 
mass. Hagen, his friend and former MIT 
classmate, also came to Imperial, and the 
three of us worked together to find a solu- 
tion. We were constant lunch companions. 
Guralnik enjoyed the work but often com- 
plained about the food — especially Scotch 
eggs (hard-boiled eggs baked into a grainy 
coating of sausage meat and breadcrumbs). 

Just as our paper was in its final draft, 
we discovered two earlier papers (one by 
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Frangois Englert and Robert Brout at the 
Université Libre de Bruxelles in Belgium, 
and the other by Peter Higgs at the Uni- 
versity of Edinburgh, UK) that addressed 
the problem of giving mass to bosons from 


different perspectives. All three papers 
showed that the mechanism could success- 
fully use spontaneous symmetry breaking, 
in which the underlying physics is symmet- 
ric but its realization is not. For example, 
a bowl of water is rotationally symmetric, 
until it freezes and the ice crystals break the 
symmetry. A particular contribution of our 
paper was to show how spontaneous sym- 
metry breaking could occur consistently 
within the formal structure of the theory. 
The three papers reached essentially the 
same conclusions, and were later selected 
by Physical Review Letters (the journal in 
which they were published) as among the 
most significant of 1964. 

The publications attracted few citations 
in their first three years. Guralnik toured 
Europe and gave talks to sceptical audi- 
ences. Werner Heisenberg, known for 
formulating the eponymous uncertainty 
principle, told him that he was talking 
nonsense. When Guralnik won a postdoc 
position at the University of Rochester in 
New York, the head of the group urged him 
to switch to a different field (later admitting 
that the advice was misguided). 

Our proposal, now generally known as the 
Higgs mechanism, is the cornerstone of the 
well-established unified electroweak theory, 
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which explains both the electromagnetic 
and weak interactions. Steven Weinberg 
and Salam married this mechanism with a 
preliminary unified model proposed ear- 
lier by Sheldon Glashow, thus resolving its 
inconsistencies. For this work, the three of 
them shared the 1979 Nobel Prize in Physics. 

The validity of the Higgs mechanism 
was finally demonstrated in 2012, with the 
experimental discovery of the Higgs boson, 
which last year led to another Nobel Prize, 
for Englert and Higgs, the surviving authors 
of the first two papers putting forth the 
mechanism. 

The 1979 prizewinners carefully cited 
all three of the 1964 papers on an equal 
footing, but others have not always done 
so. Guralnik came to feel that our early 
paper was often unfairly neglected. He 
gave talks and wrote papers pointing out 
our distinctive contribution, of which he 
was justifiably proud, and in which he was 
unquestionably the prime mover. 

In the meantime, Guralnik had gone on 
to contribute much to the development of 
computational approaches to quantum field 
theory. From 1985 to 1987, as a staff mem- 
ber at Los Alamos National Laboratory in 
New Mexico, he used such approaches to 
make important contributions to quantum 
chromodynamics, the theory of strong 
interactions, which explains that quarks 
interact through a ‘colour force’. Later, 
he worked on how chaos theory could 
be applied to particle physics. He also 
made significant contributions to string 
theory, some in collaboration with his son, 
Zachary Guralnik. 

Gerry was always good company and 
had a great sense of humour. As well as 
being passionate about physics, he was 
enthusiastic about many things: fast cars, 
photography and state-of-the-art comput- 
ers. Gerry was a member of Brown's Ersatz 
Brain Project, which aims to design a brain- 
like computer. He could be combative when 
others failed to live up to his exacting stand- 
ards, but he was warm and generous. 

Gerry once wrote that being able to work 
as a theoretical physicist was a privilege; his 
colleagues felt the privilege was theirs. m 


Tom Kibble is emeritus professor of 
theoretical physics at Imperial College 
London. He worked there with Gerald 
Guralnik in the 1960s. 

e-mail: kibble@ic.ac.uk 
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b 
Remaining Placozoa Porifera Ctenophora Choanozoa 
animals (Trichoplax) (sponges) (comb jellies) (unicellular 
flagellates) 


Figure 1 | Dual origins of nerves. a, The comb jelly Pleurobrachia bachei. b, An evolutionary tree depicting the genetic relationships between the early animal 
lineages. Moroz et al.' suggest that nerve cells (indicated by the neuron symbol) have evolved independently in the lineage that includes humans and the one that 
leads to comb jellies. Nerve cells are absent in all other animal groups. 


EVOLUTIONARY BIOLOGY 


Excitation over jelly nerves 


Analysis of the draft genome of a comb jelly and of gene-transcription profiles from ten other ctenophores hints at an 
independent evolutionary origin for the nervous systems of these organisms. SEE ARTICLE P.109 


ANDREAS HEJNOL 


r | the ability of animals to respond rapidly 
and appropriately to changes in their 
environment is due to the presence of 

a nervous system consisting of up to billions 

of nerve cells. In this issue, Moroz et al.' 

(page 109) present the genome of the comb 

jelly Pleurobrachia bachei, otherwise known 

as the sea gooseberry (Fig. la). Following a 

detailed examination of the developmental 

genes, structural genes and signalling mol- 
ecules that are necessary for the set-up and 
function of nervous systems in other animals, 
the authors come to a radical conclusion: that 
the nervous system of comb jellies might have 
evolved independently of that of other animals. 

Only two groups of animals do without a 
nervous system: sponges, which are simple 
animals attached to the sea bottom that do 
not show complex behaviours, and the placo- 
zoans, animals comprised of two flat sheets of 
cells that creep along the ocean floor absorb- 
ing nutrients. The simplicity of sponges and 
placozoans has led generations of zoologists to 
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conclude that they are ancient animal groups, 
and may look very like the first multicellular 
animals that emerged on the planet more than 
500 million years ago. 

Over the past decade, however, extensive 
comparisons of protein and DNA sequences 
have led to surprising rearrangements at the 
base of the animal tree of life. In fact, it seems 
that previous assumptions about the origin of 
multicellular animals may be wrong, and that a 
group of gelatinous creatures, the ctenophores, 
collectively referred to as comb jellies, could 
be the first group to have branched off from 
the animal tree of life’. Further evidence for 
this reappraisal came from a whole-genome 
analysis of another comb jelly, Mnemiopsis 
leidyi, which indicated that ctenophores 
lack many molecular characteristics that are 
essential for all other animals’. For example, 
they do not have microRNAs, important con- 
trollers of gene expression in other animals’, 
or Hox genes, which encode transcription 
factors that are essential for patterning the 
body axis’. 

So what are these mysterious animals that 


© 2014 Macmillan Publishers Limited. All rights reserved 


have turned the table on textbook ideas? Comb 
jellies are fabulous marine predators that pro- 
pel themselves through the water column 
by means of blocks of cilia — the shimmer- 
ing combs that give them their name. They 
catch their prey using innervated tentacles 
seamed with sticky cells called colloblasts and 
swallow it through their mouth, which opens 
into a sac-like gut. They have a nerve net with 
regional specializations, such as a sensory 
organ located at one pole of the body that is 
used for light reception and gravity sensing’. 
Notably, sponges and placozoans lack all of 
these features, so the proposal that the jellies 
evolved first seems odd. 

However, Moroz et al. agree with the idea 
of ctenophores being one of the two first 
animal lineages (Fig. 1b). Their analysis, 
which alongside the Pleurobrachia genome 
includes gene-transcription profiles from ten 
other ctenophores, finds that many building 
blocks essential for nervous-system develop- 
ment in other animals are also missing from 
comb jellies. Furthermore, some molecules 
that are neuron-specific in other animals are 
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present in the sea-gooseberry genome but, 
surprisingly, are expressed not in nerve cells but 
in other tissues. And of the 10 major common 
neurotransmitter molecules used by animals, 
Pleurobrachia employs only two (glutamate 
and its antagonist GABA) in the operation 
of its nervous system. On the basis of these 
characteristics, it would seem that an animal 
with such a small number of traditionally neu- 
ral proteins would have a simple nerve net, 
as opposed to a central nervous system, and 
would not show any complex behaviour. On 
the contrary, however, comb jellies perform 
complex actions such as predation and hori- 
zontal diurnal migrations in the water column, 
so they must use different molecules in their 
nervous system. 

The phylogenetic position of comb jellies 
at the base of the animal tree of life and the 
findings of Moroz and co-workers suggest a 
fascinating scenario — that comb jellies 
evolved a nervous system that is unrelated 
to that of other animals. Heretical hypoth- 
eses such as this strike a blow against the 
anthropocentric view that complex animals 
emerged gradually along one lineage only, 
culminating in humans, and that complex 
organ systems did not evolve twice. But such 
views do not reflect how evolution really 
works. Evolution does not follow a chain of 
events in which one lineage progresses con- 
tinuously towards complexity while other 
branches stagnate’. Instead, it is an ongoing 
process in all lineages. When the animal tree 
branched more than 500 million years ago, 
one lineage gave rise to ctenophores and the 
other to all remaining animals alive today, and 
it seems that the two lineages independently 
evolved a rapid internal communication 
system. 

However, the last word has not yet been said 
on this issue, because the branching sequence 
of the earliest animal groups is still hotly 
debated. Some researchers have expressed 
doubt that ctenophores are at the base, and 
claim that the lack of many genes in comb jel- 
lies can be explained by massive gene loss that 
mimics a simple genome’. 

Regardless of where ctenophores finally 
end up on the tree, the development and 
evolution of the complex nervous system of 
these creatures will be an enigma for some 
time. If it turns out that comb jellies are not at 
the base of the tree and that animal neurons 
indeed originated only once, someone must 
figure out why the molecular biology under- 
lying the comb-jelly nervous system is so 
different from that of other animals. m 
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How Antarctic 


ice retreats 


New records of iceberg-rafted debris from the Scotia Sea reveal episodic retreat 
of the Antarctic Ice Sheet since the peak of the last glacial period, in step with 
changes in climate and global sea level. SEE LETTER P.134 


TREVOR WILLIAMS 


bout 19,000 years ago, the ice sheets that 
Aw large areas of North America 

and Eurasia began to melt. The ice on 
Antarctica also melted, but to a lesser extent 
— most of it still exists today. Understand- 
ing past ice-sheet instability and melting is 
important for predicting future ice behaviour 
in a warming world. On page 134 of this issue, 
Weber et al.' present new well-dated records 
of iceberg-rafted debris from two marine sedi- 
ment cores from the Scotia Sea that reveal at 
least eight episodes of ice loss from Antarctica 
between 20,000 and 9,000 years ago. These 
records allow examination of interactions 
between temperature, ice melt and the water 
masses of the Southern Ocean, which are 


central to the carbon cycle and to climate 
change between glacial and interglacial periods. 

Since the Last Glacial Maximum (LGM) — 
the peak of the most recent glacial period 
— which occurred between about 26,000 
and 19,000 years ago’, melting ice sheets have 
raised global sea level by about 130 metres 
(ref. 3). But there are significant uncertainties 
in the timing and amount of ice lost from Ant- 
arctica. For example, estimates of sea-level rise 
resulting from Antarctic ice melt have ranged 
from about 8m to 30m, with the most recent 
estimates around the lower end of this range’. 

Currently, Antarctica loses ice by two main 
processes: melting of the underside of floating 
ice shelves and calving of icebergs. The ice- 
bergs melt slowly as they are carried westwards 
along the coast of Antarctica, and icebergs 


Figure 1 | Iceberg-rafted debris from marine sediment offshore of Porpoise Bay, East Antarctica. 
Weber et al.' describe similar iceberg-rafted debris from two marine sediment cores from the southern 
Scotia Sea that document several episodes of ice loss from Antarctica between 20,000 and 9,000 years ago. 
Scale bar, 2mm. 
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that reach the Weddell Sea are blocked by the 
Antarctic Peninsula and turn northwards, 
where they melt more rapidly in the warmer 
waters of the Scotia Sea. Icebergs themselves 
are ephemeral, but they carry mineral grains 
and rock fragments scoured from Antarctic 
bedrock. As icebergs melt, this iceberg-rafted 
debris (IBRD; Fig. 1) falls to the seabed and is 
steadily buried in marine sediments to form a 
record of iceberg activity. 

Some icebergs are more debris-rich than 
others: large tabular icebergs calved from 
floating ice shelves have already lost much of 
their debris-rich bases to melting, whereas 
icebergs that calve close to the line between 
grounded and floating ice tend to retain their 
debris, and are probably more common during 
ice-sheet retreat than during times of ice-sheet 
stability. Nevertheless, the main interpretive 
link is sound — more IBRD is a sign of more 
icebergs and greater ice loss from the Antarctic 
Ice Sheet. The two IBRD records reported by 
Weber and colleagues are similar, even though 
the sediment-core sites are separated by 2° of 
latitude and are subject to different local 
oceanographic conditions. This similarity 
increases confidence that the IBRD records 
represent an iceberg signal. 

Until now, the timing of ice retreat has been 
constrained by radiocarbon dating of marine 
sediments and by dating of land surfaces that 
were uncovered as the ice sheets thinned”®. 
Here, the eight episodes of iceberg discharge 
were dated by matching the record of wind- 
blown dust in the same sediment cores to wind- 
blown dust in an already-dated Antarctic ice 
core. This approach results in a continuously 
dated record, which provides a significant 
advance in knowledge of when the Antarctic 
Ice Sheet retreated over the time since the LGM. 
Weber et al. find that the first of the Antarctic 
ice discharges took place 20,000-19,000 years 
ago and was followed by a series of larger epi- 
sodes between 17,000 to 9,000 years ago. 

The largest iceberg release lasted from 
14,800 to 14,400 years ago and overlapped, 
within dating uncertainty, with a period of sea- 
level rise known as meltwater pulse LA (MWP- 
1A), which occurred 14,650 to 14,310 years 
ago’. During this period, sea levels rose by 
about 14-18 m at the astonishing rate of 4m 
or more per century. Weber and colleagues’ 
iceberg-discharge data clearly show a contri- 
bution to M™WP-1A from Antarctica, but how 
much meltwater does this represent? Recent 
work" puts the total budget for sea-level rise 
from Antarctic ice melt since the LGM at about 
9m, and this melt budget has to be shared 
among the eight iceberg-discharge events, 
including MWP-1A. By this reckoning, the 
Antarctic contribution to MWP- 1A is rela- 
tively minor compared with the contribution 
from Northern Hemisphere ice, which must 
provide the balance of the roughly 14-18-m 
rise in sea level. This result contrasts with 
modelling of differences in amplitude of the 
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MWP-1A sea-level rise at different locations 
(caused by the gravitational and rotational 
effects of removing the ice mass), which esti- 
mates that half or more of MWP-1A comes 
from Antarctica’. This mismatch has yet to be 
fully resolved. 

Questions remain about which areas of 
the Antarctic Ice Sheet became unstable and 
produced these iceberg-discharge events. Did 
major ice-drainage sectors retreat simulta- 
neously, perhaps in response to external 
triggering such as an initial sea-level rise from 
the north or southward migration of relatively 
warm Circumpolar Deep Water, as modelled 
by the authors? Or did different sectors retreat 
independently, as individual thresholds for 
instability were crossed in each sector? Prin- 
cipal sources of icebergs were probably the 
nearby Antarctic Peninsula and Weddell Sea 
embayment, where ice streams drain about a 
quarter of Antarctic ice by area. Icebergs are 
also likely to have travelled from other ice out- 
lets around East Antarctica. The provenance 
of the IBRD, and the icebergs that carried it, 
can be found by matching the geochemical 
fingerprint (such as characteristic argon- 
isotope ages) of individual mineral grains in 
the IBRD to the corresponding geochemical 
fingerprint of the different source areas* — a 
topic for future study. 

The episodic iceberg discharges described 
by Weber and colleagues shed light on the 
question of how the Antarctic ice sheets melt. 
Will there be similar iceberg releases in the 
future? Ice streams in the Amundsen Sea sec- 
tor of the West Antarctic Ice Sheet are already 
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in the early stages of retreat”’®. During the last 


interglacial period, about 125,000 years ago, 
sea levels reached 6-9 m higher than today’"”, 
much of this attributable to an Antarctic melt- 
water source, at global temperatures only 
1-2°C warmer than those of today. The planet 
is on course for a temperature rise exceeding 
this value, so we can expect similar ice-sheet 
instability and retreat to that described by 
Weber et al. in the future. = 


Trevor Williams is at the Lamont-Doherty 
Earth Observatory, Columbia University, 
Palisades, New York 10964, USA. 

e-mail: trevor@ldeo.columbia.edu 


1. Weber, M. E. et al. Nature 510, 134-138 (2014). 

2. Clark, P. U. et al. Science 325, 710-714 (2009). 

3. Austermann, J., Mitrovica, J. X., Latychev, K. & 
Milne, G. A. Nature Geosci. 6, 553-557 (2013). 

4. Whitehouse, P. L., Bentley, M. J. & Le Brocq, A. M. 
Quat. Sci. Rev. 32, 1-24 (2012). 

5. Heroy, D.C. & Anderson, J. B. Quat. Sci. Rev. 26, 
3286-3297 (2007). 

6. Hillenbrand, C.-D. et al. Quat. Sci. Rev. http://dx.doi. 
org/10.1016/j.quascirev.2013.07.020 (2013). 

7. Deschamps, P. et a/. Nature 483, 559-564 (2012). 

8. Pierce, E. L. et al. Paleoceanography 26, PA4217 
(2011). 

9. Joughin, |., Smith, B. E. & Medley, B. Science http:// 
dx.doi.org/10.1126/science.1249055 (2014). 

10.Rignot, E., Mouginot, J., Morlighem, M., Seroussi, H. 
& Scheuchl, B. Geophys. Res. Lett. http://dx.doi. 
org/10.1002/2014GL060140 (2014). 

11.Kopp, R. E., Simons, F. J., Mitrovica, J. X., Maloof, 
A.C. & Oppenheimer, M. Nature 462, 863-867 
(2009). 

12.Dutton, A. & Lambeck, K. Science 337, 216-219 
(2012). 


This article was published online on 28 May 2014. 


Short RNAs and 
shortness of breath 


The simultaneous deletion of six RNA molecules in mice has been found to cause 
respiratory and fertility defects, owing to improper assembly of structures called 


cilia on the cell surface. SEE ARTICLE P.115 


IRMA SANCHEZ & BRIAN D. DYNLACHT 


efects in the assembly of cilia — fine 
D projections of the cell surface — are 

the cause of a plethora of human dis- 
eases’. One such disease is primary ciliary 
dyskinesia, a syndrome that severely com- 
promises the function of the respiratory tract 
and reproductive system, resulting in airway 
infections and infertility. Identification of fac- 
tors that control the assembly of cilia would 
constitute a major scientific advance. In this 
issue, Song et al.’ (page 115) implicate clusters 
of non-protein-coding RNA molecules in the 
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regulation of cilium assembly, and show that 
deletion of these regulators in mice leads to 
a syndrome reminiscent of primary ciliary 
dyskinesia. 

There are two types of cilium, non-motile 
and motile, both of which are essential for 
normal growth and development. Single, non- 
motile primary cilia are present on diverse cell 
types and function as sensors that provide 
the cell with information about the external 
environment. Motile cilia, meanwhile, func- 
tion to coordinate fluid flow, and are found in 
large numbers on multiciliated cells (MCCs) 
in the respiratory tract and the Fallopian tubes 


of the female reproductive tract. Assembly of 
both types (a process known as ciliogenesis) 
is initiated by a cellular organelle called the 
basal body. A basal-body precursor, located 
inside another organelle, the centrosome, 
matures and migrates to the plasma mem- 
brane, extending microtubular structures into 
the extracellular space to form the axoneme — 
the skeleton of the cilium (Fig. 1). But how this 
process is controlled is unclear. 

A group of small, non-protein-coding 
RNAs called microRNAs (which regulate 
protein production by inhibiting the transla- 
tion of messenger RNA) seem to be new-found 
regulators of motile ciliogenesis. The genetic 
region, or locus, known as miR-449 encodes 
three such miRNAs, and has been linked** 
to the development of a type of ciliated cell 
in the bronchi that is responsible for clearing 
mucus. A study” of the role of miR-449 in two 
types of MCC — the mucociliary epidermis 
cells of embryonic frogs and the cells that line 
the human bronchi — showed that expression 
from this locus increases as cellular precur- 
sors differentiate into MCCs. The miR-449 
locus shares sequence similarity with two 
other miRNA loci, miR-34a and miR-34b/34c. 
Together, these three loci encode six miRNAs 
in vertebrates. 

The miR-34/449 miRNAs are remarkably 
abundant in tissues enriched with motile 
cilia, comprising around 50% of the miRNAs 
expressed in the embryonic epidermis of frogs 
and 13% of the miRNAs in the cells that line 
the human respiratory tract*. To test whether 
the miR-34/449 cluster is essential for the ini- 
tiation of ciliogenesis, Song et al. deleted all 
three loci in mice by mating existing strains 
harbouring single deletions. Simultaneous 
deletion was necessary because the six miRNA 
molecules have similar functions, and so can 
compensate for one another if only one locus 
is deleted. Triple-mutant mice failed to thrive 
— only 40% reached adulthood, and those that 
did were half the size of controls. The mice 
exhibited severe respiratory distress owing to 
a failure to clear mucus, and were sterile, mir- 
roring defects observed in people with primary 
ciliary dyskinesia. 

Song and colleagues obtained tracheal tissue 
from the mutant mice and examined it using 
high-speed imaging. They observed abnor- 
mal ciliation in mutant tracheal MCCs, which 
resulted in reduced fluid flow. This was not due 
to a developmental defect in the MCC precur- 
sors, but rather to a substantial reduction in the 
length and number of cilia per cell. 

Multi-ciliation is achieved through an 
ordered process, beginning with the multipli- 
cation of basal bodies that subsequently dock 
at the cell surface. The authors found no differ- 
ences in basal-body replication in mutant and 
control cells. Instead, high-resolution struc- 
tural studies revealed that many basal bod- 
ies remained undocked in mutant cells, and 
so failed to assemble cilia. Those that did 
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Figure 1 | Ciliogenesis. The production of multiciliated cells from precursors occurs through an 
orderly process known as ciliogenesis, which begins with a single precursor of an organelle called the 
basal body, located inside another organelle, the centrosome. As the cell differentiates, the basal-body 
precursor multiplies and matures. Basal bodies then migrate and dock to the surface of the cell, extending 
microtubular structures into the extracellular environment to form cilia. Song et al? report that, as 
precursors undergo differentiation, six miRNA molecules expressed by three related genetic regions 
regulate normal ciliogenesis in mice and frogs. The miRNAs inhibit translation of the messenger RNA 
that encodes the centrosomal protein CP110. The authors find that enhanced expression of CP110, 
brought about by simultaneous deletion of the six miRNAs, prevents normal docking of basal bodies to 


the cell surface (not shown). 


dock generated significantly shorter, albeit 
structurally normal, axonemes. 

Together, these data suggest that the 
miR-34/449 cluster might regulate basal- 
body docking. Song and co-workers screened 
a panel of genes for potential miR-34/449 tar- 
gets, looking for genes that not only contained 
potential miR-34/449 binding sites but also 
showed downregulated expression during 
MCC differentiation (indicative of inhibition 
by highly expressed miRNAs). The authors 
identified more than 50 candidate genes from 
the screen, and chose the centrosomal protein 
CP110 asa focus for further investigation. 

CP110 is an attractive target of miR-34/449 
because previous work® has shown that over- 
expression of CP110 suppresses primary cili- 
ary assembly, and that destruction of CP110 
at basal bodies is required to trigger early 
stages of ciliogenesis. MicroRNAs are known 
to have multiple targets’, and the miR-34/449 
cluster may regulate other transcripts that 
also influence ciliogenesis — much like miR- 
129-3p, which targets both cp110 in ciliated 
cells and components of the cell’s structural 
skeleton in other cell types*. However, when 
Song et al. inhibited cp110 in frog embryos 
lacking miR-34/449 (which have increased 
levels of CP110), they found a substantial 
improvement in basal-body docking and fewer 
cilia defects, suggesting that excessive CP110 
caused most of the observed defects. The cur- 
rent study, together with the previous report on 
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miR-129-3p (ref. 8), unifies a rapidly growing 
body of evidence establishing an evolutionarily 
conserved regulatory role for CP110 in cells 
with primary and motile cilia. 

It is worth noting that proliferating and 
differentiating cells dedicate a remarkable level 
of attention to the control of CP110 abun- 
dance, which is regulated by transcriptional’, 
post-transcriptional*’ and post-translational 
mechanisms’. Indeed, no fewer than seven 
miRNAs (miR-129 and the miR-34/449 clus- 
ter)*** and a pair of enzymes'*"' have been 
identified as being involved in the robust regu- 
lation of cp110 levels. It is likely that the find- 
ings of Song and colleagues represent the tip 
of the iceberg, and it will be exciting to explore 
the roles that other miRNAs have in modula- 
ting the levels of ciliary proteins that require 
similar fine tuning. 

More generally, it will be useful to under- 
stand how subtle changes in the regulation 
of proteins such as CP110 could give rise to 
human disease. Intriguingly, links between 
aberrant CP110 levels and respiratory ill- 
ness have already been uncovered in a pre- 
vious study’. That study and the work of 
Song and colleagues indicate the impor- 
tance of exploring the role of genes such 
as cp110, which regulate basal body and 
centrosomal components, in diseases asso- 
ciated with defects in ciliogenesis, includ- 
ing primary ciliary dyskinesia and human 
respiratory disease. m 
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The MAD world 
of black holes 


An analysis of optical and radio observations has revealed how powerful jets are 
launched from the centres of active galaxies, where supermassive black holes 
accrete matter through magnetically arrested disks, or MADs. SEE LETTER P.126 


DENISE GABUZDA 


Ithough the light given off by most 
As is due to stars and glowing 

gas, some galaxies have extremely 
bright centres, or nuclei, with luminosities 
about 100,000 times greater than those of 
normal galaxies. In such active galactic nuclei, 
energy is liberated when matter spirals 
inwards and is captured by a supermassive 
black hole — billions of times more massive 
than the Sun — sitting at the galactic centre. 
In about 10% of these active nuclei, some of 
the in-spiralling matter is pushed into two 
jets of matter and radiation that shoot out in 
opposite directions at close to the speed of light 
(Fig. 1). However, the forces causing these out- 
flows have remained unknown. In this issue, 
Zamaninasab et al.' (page 126) report direct 
evidence that the jets are launched on their 


journey by a kind of gigantic electromagnetic 
generator, in which magnetic fields in the 
vicinity of the black hole are twisted by the 
black hole’s spin, with the energy of this spin 
being transformed into the energy of the jets’ 
outward motion’. 

It has long been suspected that this mecha- 
nism might explain how jets in active galactic 
nuclei (AGN) are produced. However, direct 
observational evidence has been elusive, 
because the scales on which this mechanism 
should operate are tiny compared with the 
smallest scales that can be observed directly 
using the highest-resolution technique 
available, which are typically about 1 par- 
sec (3 x 10’° metres) from the central black 
hole. This scale might seem huge, but at the 
extremely large distances of AGN (billions of 
parsecs), it translates to a tiny angular distance 
as observed on the sky. 
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Images of such small scales can be obtained 
using very-long-baseline interferometry 
(VLBI), an elegant technique in which radio 
telescopes around the world observe in syn- 
chrony to imitate a single radio telescope with 
a diameter the size of Earth. The larger the 
telescope used, the finer the detail that can be 
seen; accordingly, VLBI yields radio images 
with phenomenally high resolution, equivalent 
to being able to peer across the Atlantic Ocean 
from the western edge of Europe and identify a 
coin held by someone standing on the eastern 
coast of the United States. 

Unfortunately, however, the scales imaged 
with VLBI are still tens to thousands of times 
larger than those on which the powerful jets 
of AGN are launched. Zamaninasab et al. 
have found a clever way to bridge this gap, 
by considering the magnetic flux in the jets 
— essentially, the product of the magnetic 
field pointing along the jet and the jet’s cross- 
section. The magnetic flux near the black hole 
cannot be measured directly, but should be 
proportional to the luminosity of the matter in 
the accretion disk surrounding the black hole. 
The disk’s luminosity can be estimated from 
observations of optical lines in the spectrum 
of the AGN. 

The authors have found a tight linear cor- 
relation between the estimated accretion-disk 
luminosities for 76 AGN and the magnetic 
fluxes in their jets measured with VLBI. This 
correlation demonstrates that the magnetic 
flux near the black hole is proportional to the 
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Figure 1 | Jets emerging from the centre of an active galactic nucleus. a, Composite image of the optical (true colour), X-ray (blue) and radio (orange) 
emission of the giant elliptical galaxy NGC 5128, showing its powerful radio jets extending far beyond the optical galaxy. b, Zamaninasab et al.' have found 
evidence that jets emerging from the centres of active galaxies are launched from deep in the galactic nucleus (square in a) by twisted magnetic fields’, and that 
these fields are dynamically important in the vicinity of the galaxy’s central black hole and surrounding accretion disk. 
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A: ESO/WFI (OPTICAL); MPIFR/ESO/APEX/A. WEISS ET AL. (SUBMILLIMETRE); 


NASA/CXC/CFA/R. KRAFT ET AL. (X-RAY) 


magnetic flux far down the jets, as would be 
expected if the electromagnetic jet-launching 
mechanism referred to above is at work. The 
authors’ results thus provide direct observa- 
tional evidence that this is the case. 

Theoretical simulations™ of accretion disks 
have shown that, under certain conditions, the 
magnetic flux in the vicinity of the black hole 
naturally reaches a maximum equilibrium 
value. When this happens, forces exerted by 
the magnetic field dominate in the inner part 
of the disk. Disks for which this is true are 
called magnetically arrested disks, or MADs™. 
Zamaninasab et al. find that the derived slope 
of the linear correlation between the accretion- 
disk luminosity and the jet magnetic flux is 
precisely the value predicted for such disks, 
strongly suggesting that this MAD scenario is 
operating in the hearts of AGN. 

These results indicate that the jets of AGN 
are launched electromagnetically by magnetic 
fields twisted by the black hole’s spin, that these 
magnetic fields have a dominant role in deter- 
mining the dynamics of the disk and jets in the 
vicinity of the central black hole, and that this 
may remain true at least out to VLBI scales, 
several parsecs from the black hole. It will 
therefore be important to consider the influ- 
ence of the magnetic field, for example, when 
inferring the properties of the central black 
hole and accretion disk from high-resolution 
studies made with millimetre-wavelength, 
ground-based VLBI*”’, or with ‘space VLBI, 
in which one or more antennas orbiting Earth 
are used with ground antennas*”. 

Zamaninasab and colleagues’ findings also 
radically change the way astronomers view 
the jets emanating from the centres of AGN. 
These jets are not just outflows of matter car- 
rying tremendous amounts of energy, but are 
also intrinsically magnetic structures. Many 
of their properties are probably determined 
by the magnetic fields embedded in them and 
travelling outwards with them. The twisting 
of the central magnetic fields that launches 
the jets should give rise to helical jet mag- 
netic fields, which may be manifest in the 
jets’ magnetic-field structure and morphol- 
ogy". Because a fundamental relationship 
exists between magnetic fields and electrical 
currents, jet outflows should be regarded 
as systems of magnetic fields and currents. 
This is essential if we are to understand these 
enormous structures: how they propagate, 
why they remain so narrow as they traverse 
enormous distances, and how they inter- 
act with the material through which they 
are moving. 

As the jets travel beyond their host galaxy 
and into intergalactic space, effects other than 
magnetic forces will probably also come into 
play, making the jets and the surrounding gas 
more turbulent and reducing the magnetic 
field’s effects. Further detailed studies of the 
jets of AGN and their magnetic fields, from 
VLBI scales out to the ends of the jets many 


thousands of parsecs from the central black 
hole, should help to determine whether such a 
transition occurs, and where. m 
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Treatment triumphs 


Astampede of recent clinical studies suggests that we are on the cusp of 
developing well-tolerated, orally delivered drugs that can effectively eradicate 
hepatitis C virus from most, if not all, infected individuals. 


CHARLES M. RICE & MOHSAN SAEED 


he story of hepatitis C began in the 
1970s, when it was recognized that 
something other than hepatitis A or 
hepatitis B infections was causing liver inflam- 
mation following blood transfusions’. In 
1989, the troublemaker was identified as a 
small RNA virus, named hepatitis C (HCV)’. 
Although there are now effective diagnostic 
procedures that allow a safe blood supply in 
most developed countries, intravenous drug 
abuse continues to lead to new infections. An 
estimated 185 million people are chronically 
infected with HCV and are at risk of develop- 
ing life-threatening liver diseases, including 
cirrhosis and cancer’. But a recent series of clin- 
ical trials, reported in the New England Journal 
of. Medicine*'', demonstrate drastic increases in 
the effectiveness of anti- HCV drugs. 
Historically, HCV-infected patients have 


1980s 1989 
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Mystery HCV identified, 

virus diagnostics developed 
~5% 
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2002 


been treated by intravenous injections with 
type I interferons — secreted cellular proteins 
that elicit potent antiviral responses’. The 
success rates for interferon-based regimens 
improved from single digits in the 1970s to 
around 50% by 2002, accomplished by increas- 
ing dose, lengthening treatment, chemically 
stabilizing the interferon (by PEGylation) and 
adding ribavirin, an RNA-nucleoside analogue. 
Ribavirin has poor anti-HCV activity when 
used alone but significantly increased treat- 
ment success when combined with interferon 
(by mechanisms that are still unsettled). How- 
ever, this treatment required a 24- or 48-week 
course and was plagued by awful side effects, 
including nausea, depression and anaemia. 
Hence, the goal remained to develop highly 
effective, orally administered and well-toler- 
ated regimens that work for all patient groups. 

Two enzymes encoded by HCV that are 
essential for viral replication — a serine 
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2014 
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PEG-IFN + PEG-IFN + All-oral 
ribavirin ribavirin + DAA DAA 
~50% The >95% 
cure cure cure 


Figure 1 | HCV trajectory. In the 1980s, mysterious cases of liver inflammation following blood 
transfusions that were not explained by hepatitis A or hepatitis B viral infections were treated using type I 
interferon proteins, with a success rate of around 5%. The cause of these infections was identified in 1989 as 
RNA virus hepatitis C (HCV). The combination of PEGylated interferon (PEG-IFN) and ribavirin, approved 
in 2002, improved cure rates to around 50%. By 2011, drug cocktails containing HCV-specific direct-acting 
antivirals (DAAs) were being used to treat patients, with around 75% cure rates, and recent clinical trials*"’ of 
all-oral, interferon-free, DAA-based regimens have increased treatment success rates to more than 95%. 
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protease (NS3-4A) and an RNA polymerase 
(NS5B) — are attractive drug targets. In the 
2000s, inhibitors of these enzymes and of 
another non-enzymatic but essential HCV 
protein (NS5A), referred to as direct acting 
antivirals (DAAs), emerged as the lead targets 
for HCV drug development. In late 2011, two 
NS3-4A protease inhibitors were approved for 
human use in combination with PEGylated 
interferon and ribavirin, raising treatment suc- 
cess to more than 70% for patients with HCV 
genotype 1 (there are six highly divergent and 
variable genotypes of the virus). 

However, euphoria over this advance was 
short-lived. Patients with advanced disease 
were treated but many others were not, owing 
to the additional, often severe, side effects of 
this drug combination and the emergence of 
viral resistance. In the meantime, and continu- 
ing into the present, dozens of new compounds 
were being tested in the clinic. In 2013, more- 
potent DAAs, in combination with PEGylated 
interferon and ribavirin, were approved, as was 
the first all-oral regimen, consisting of a NS5B- 
targeting DAA combined with ribavirin alone. 

The recent clinical studies*’ present the 
next wave of interferon-free, all-oral, DAA- 
based regimens, which are likely to be approved 
in the near future for HCV treatment. Without 
delving into details and trade names, several 
key points about these trials emerge. First, they 
include multiple all-oral combinations that can 
achieve success rates of more than 95%. ‘Suc- 
cess’ for HCV treatment means no detectable 
virus 12 weeks after stopping treatment. Unlike 
drug treatments for hepatitis B and HIV, most 
HCV researchers believe that this endpoint 
represents a durable cure that lowers the risk 
of progressive liver disease. Second, these 
treatments are effective in patients who are 
in greatest need and are most difficult to treat 
— those with advanced fibrosis and cirrhosis, 
those who are co-infected with HIV, and even 
liver-transplant candidates and recipients. Also 
noteworthy is that the new drug combinations 
promise shorter treatment times (12 weeks and 
possibly even less) and minimal side effects; as a 
result, fewer people are expected to discontinue 
their treatment. 

So from a mystery virus and a 5% treatment- 
success rate, we have come to an era of cure 
rates of more than 95% (Fig. 1). Game over, 
right? Not quite. What about viral resistance 
to the drugs? With nearly 200 million infected 
individuals, 6 diverse viral genotypes and 
around | trillion viral variants being gener- 
ated per day per infected person, it is likely 
that HCV will have some tricks up its sleeve to 
develop resistance. However, some of the new 
DAAs, in particular sofosbuvir, which targets 
the active site of NS5B, have an extremely 
high barrier to resistance, and there have 
been only rare glimpses of resistant variants in 
clinical observations with multiple viral geno- 
types’. Combining potent DAAs, each with 
lower resistance barriers, can still be highly 
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effective at avoiding the build-up of resistance. 
Nonetheless, resistance will undoubtedly 
occur and should be taken into account to 
guide treatment decisions. The current drugs 
are also less effective against genotype 3 HCV, 
which is common in South Asia, although pan- 
genotype drugs are in development. 

Another barrier is identifying those 
infected. Most people are unaware of their 
HCV infection“, and only a small minority has 
been treated’*. Although some health agen- 
cies have recommended universal screening 
of high-risk groups, implementing such poli- 
cies is challenging and time-consuming. And 
once infected individuals are identified, how 
will society pay for their treatment? The cur- 
rent price tag for cutting-edge HCV treatment 
in the United States is more than US$80,000 
for a 12-week course. Competition among 
pharmaceutical companies may lower this 
price, but most people infected with HCV live 
in countries that cannot afford the new treat- 
ments. Fortunately, there is movement in the 
pharmaceutical industry to provide for low- 
cost drug production in certain countries, such 
as Egypt, where an estimated 10% of the pop- 
ulation is infected. Finally, getting rid of the 
virus does not always erase the risk of future 
liver-related problems — patients still need to 
be monitored routinely for liver function and 
cancer, particularly those whose infection had 
led to cirrhosis. 

With the new drugs that are in hand or on 
the horizon, we have the means to eradicate 
this virus, possibly without needing a vaccine. 
However, the challenge now is to extend these 


NEUROLOGICAL DISORDERS 


great medical advances on a national and 
global scale to those in need — something that 
has not been terribly effective in the past. We 
can hope that implementing these transforma- 
tive HCV advances will help to create a model 
for success, for this and other widespread 
human diseases. = 
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Quality-control 
pathway unlocked 


A modified ubiquitin protein has been identified by three independent studies 
as the missing link in a cellular quality-control pathway that is implicated in 


Parkinson’s disease. SEE LETTER P.162 


ASA ABELIOVICH 


degenerative disorder, has long been 

hypothesized to be caused by defects in 
organelles called mitochondria, which power 
mammalian cells through the production 
of ATP molecules. An accumulation of dys- 
functional mitochondria may lead not only 
to a cellular energy crisis, but also to excessive 
production of toxic by-products. Two enzymes 
implicated in Parkinson's disease, PINK1 and 
parkin'”, are thought to be involved in the 
disposal of defective mitochondria, but how 


Pp arkinson’s disease, a progressive neuro- 
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the two proteins interact has been unclear. A 
trio of studies (by Kane et al.’, writing in the 
Journal of Cell Biology; by Kazlauskaite et al.’, 
in the Biochemical Journal; and by Koyano 
et al.”, on page 162 of this issue) now report 
that phosphorylated ubiquitin protein is the 
link between PINK1 and parkin, provid- 
ing insights into a complex system of parkin 
regulation. 

Kinase enzymes such as PINK] alter the 
behaviour of target proteins through the addi- 
tion of phosphate groups, a process called 
phosphorylation. PINK] is imported to mito- 
chondria and, in healthy cells, undergoes 


rapid degradation’. However, if mitochondria 
are defective or damaged (for example by 
exposure to CCCP, a poison that blocks ATP 
production), PINK1 accumulates, becoming 
anchored to the outer mitochondrial mem- 
brane with its kinase domain exposed to the 
cytoplasm. 

Damaged mitochondria also attract par- 
kin, which is otherwise dispersed throughout 
the cytoplasm in healthy cells’. Parkin is a 
ubiquitin ligase, which adds ubiquitin pro- 
teins (either singly or in polyubiquitin chains) 
both to itself through autoubiquitination and 
to nearby target proteins. Ubiquitinated pro- 
teins can serve as a signal to the cell that a cel- 
lular compartment should be degraded, which 
in damaged mitochondria leads to their timely 
disposal’, a process known as mitophagy. 

Mutations in either PINK1 or PARKIN 
that underlie rare familial forms of Parkin- 
son’s disease disrupt mitophagy, implicating 
this cellular pathway in Parkinson’s disease’. 
Furthermore, PINK1 mutations impede the 
recruitment of parkin to damaged mitochon- 
dria, suggesting that the proteins act in a linear 
pathway. Consistent with a PINK1-parkin 
quality-control pathway, mutations in pink1 
or parkin in fruit flies cause accumula- 
tion of defective mitochondria and cellular 
degeneration*”. 

Initial models proposed that PINK1 phos- 
phorylates and so activates parkin in damaged 
mitochondria. Although direct phospho- 
rylation of parkin by PINK] has been docu- 
mented”, this modification does not seem to 
be sufficient for full activation of parkin’s ubiq- 
uitin-ligase activity’ >"”. In search of a func- 
tional connection between PINK] and parkin, 
three groups undertook cell-wide protein 
analyses and biochemical studies, and found 
the missing link between the two — phospho- 
rylated ubiquitin (phospho-ubiquitin). 

Each study showed that, in cells in which 
PINK1 was activated by CCCP treatment, 
PINK1 phosphorylates ubiquitin at a serine 
amino-acid residue (serine 65). Strikingly, a 
corresponding serine-65 residue in a ubiqui- 
tin-like domain is the aforementioned target 
of PINK1 phosphorylation on parkin”®. Sub- 
sequent analyses by all three groups dem- 
onstrated that modified ubiquitin, in turn, 
induces parkin activity (Fig. 1). 

Koyano and co-workers found that modi- 
fied ubiquitin alone could not fully activate 
parkin — complete activation required coin- 
cident modification of parkin’s ubiquitin-like 
domain as well as of ubiquitin, each at their 
respective serine-65 residues. A unique aspect 
of this group’s work is their use of a strain of 
yeast that harbours a mutant form of ubiquitin 
lacking the serine-65 residue, which cannot be 
phosphorylated by PINK1. When the authors 
added human PINK1 and parkin to these 
cells, they found that parkin was not activated, 
underscoring the idea of an ordered pathway 
for mitophagy. 
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Figure 1 | PINK] and parkin in mitochondrial quality control. Mitochondrial damage leads to 
anchoring of the PINK] enzyme to the outer mitochondrial membrane, with its kinase domain facing the 
cytoplasm. PINK] adds a phosphate group (P) to the ubiquitin-like domain (Ubl) of the ubiquitin-ligase 
enzyme parkin. Three studies*~ find that PINK1 also phosphorylates the ubiquitin (Ub) protein itself. 
Phosphorylated ubiquitin directly binds to and activates parkin. Activated parkin ligates ubiquitin and 
phospho-ubiquitin molecules to nearby target proteins, leading to disposal of the damaged mitochondria 


through mitophagy. 


Whereas all three studies implicate phos- 
phorylated ubiquitin as an intermediary in 
the PINK 1-parkin pathway, the role of direct 
phosphorylation of parkin by PINK1 seems 
more complex. Koyano and colleagues report 
that modification of both ubiquitin and parkin 
at serine-65 is necessary for full activation of 
parkin in cells. But Kane and colleagues found 
evidence that modification of ubiquitin alone 
can activate parkin. This discrepancy is likely 
to relate to the distinct assays used in the stud- 
ies, rather than to a biological difference. 

Consistent with phospho-ubiquitin's activat- 
ing role, Kane et al. and Koyano et al. found 
that it binds directly to parkin. Koyano and 
colleagues took the studies a step further, dem- 
onstrating that phospho-ubiquitin can still be 
used by parkin as a substrate for ubiquitina- 
tion and autoubiquitination. But, surprisingly, 
the group found that parkin could be activated 
by phospho-ubiquitin that was mutated or 
modified such that it could not act directly 
as a substrate in ubiquitination. This implies 
that phospho-ubiquitin binds to and activates 
parkin separately from its role as a substrate. 

Clues as to how this could be achieved 
might be gleaned from recent crystallographic 
analyses of parkin’”'. A phospho-peptide 
binding pocket has been proposed" to lie 
within an inhibitory domain in parkin that, 
when the protein is inactive, occludes access 
to its catalytic active site. Kazlauskaite et al. 
speculate that the active site of parkin could be 
exposed by conformational changes brought 
about by the binding of phospho-ubiquitin’s 
phosphate group to this inhibitory domain. 

Kane and co-workers’ data point to another 
role for phospho-ubiquitin — recruiting par- 
kin to the outer membrane of damaged mito- 
chondria. A particularly interesting idea is 
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that such recruitment may generate a positive 
feedback loop, in which recruited parkin would 
be predicted to ligate additional phospho- 
ubiquitin to nearby proteins, attracting yet 
more parkin. 

A subset of known parkin substrates, includ- 
ing the proteins mitofusin 2 and Miro, regulate 
mitochondria!*", and their ubiquitination by 
parkin may be required for normal mitophagy. 
It will be important to determine whether acti- 
vation by phospho-ubiquitin affects parkin’s 
target selection, the fate of ubiquitinated tar- 
get proteins, or the structure of polyubiquitin 
chains formed on targets. Finally, drugs that 
mimic the effects of phospho-ubiquitin may 
be candidate therapeutics for inherited and 
sporadic forms of Parkinson's disease. m 


Asa Abeliovich is in the Departments of 
Pathology, Cell Biology and Neurology, 

and at the Taub Institute, Columbia 
University, New York, New York 10032, USA. 
e-mail: aa900@columbia.edu 


1. Valente, E. M. et al. Science 304, 1158-1160 
(2004). 

2. Kitada, T. et al. Nature 392, 605-608 (1998). 

3. Kane, L.A. etal. J. Cell Biol. 205, 143-153 (2014). 

4. Kazlauskaite, A. et al. Biochem. J. 460, 127-139 
(2014). 

5. Koyano, F. et al. Nature 510, 162-166 (2014). 

6. Jin, S.M. et al. J. Cell Biol. 191, 933-942 (2010). 

7. Narendra, D. P. et al. PLoS Biol. 8, e1000298 
(2010). 

8. Clark, |. E. et al. Nature 441, 1162-1166 (2006). 

9. Park, J. et al. Nature 441, 1157-1161 (2006). 

10.Shiba-Fukushima, K. et a/. Sci. Rep. 2, 1002 
(2012). 

11.Wauer, T. & Komander, D. EMBO J. 32, 2099-2112 
(2013). 

12.Trempe, J. F. et al. Science 340, 1451-1455 (2013). 

13.Poole, A. C. et al. Proc. Natl Acad. Sci. USA 105, 
1638-1643 (2008). 

14.Ziviani, E., Tao, R. N. & Whitworth, A. J. Proc. Nat! 
Acad. Sci. USA 107, 5018-5023 (2010). 


5 JUNE 2014 | VOL 510 | NATURE | 45 


NatuULreiNnsiGHT 


LIPIDS IN HEALTH AND DISEASE 


nature iNsiGHT 


Cover illustration 
Jasiek Krzysztofiak 


Editor, Nature 
Philip Campbell 
Publishing 
Richard Hughes 
Production Editor 
Jenny Rooke 

Art Editor 

Nik Spencer 
Sponsorship 
Reya Silao 
Production 

lan Pope 
Marketing 

Elena Woodstock 
Steven Hurst 


Editorial Assistant 
Abbie Williams 


The Macmillan Building 

4 Crinan Street 

London N1 9XW, UK 

Tel: +44 (0) 20 7833 4000 
e: nature@nature.com 


& 


nature publishing group 


5 June 2014 / Vol1510 / Issue No 7503 


out a multitude of crucial roles. For example, 

they act as structural elements in biological 
membranes, they store energy and they function as 
signalling molecules in cellular response pathways. 
Disruption to their levels of expression — as occurs 
in obesity, diabetes, autoimmunity or inborn errors of 
lipid metabolism — leads to dysfunction and disease in 
many organs. 

In this Insight, Joost Holthuis and Anant Menon 
begin by discussing the cellular pathways by 
which eukaryotic cells maintain a variety of lipid 
compositions in different organelles so that they can 
carry out their specialized tasks. 

Next, Michael Maceyka and Sarah Spiegel discuss 
the molecular mechanisms by which the metabolites 
of sphingolipids exert their cell signalling and 
pathological functions. 

Sphingolipid lysosomal storage disorders 
are characterized by the accumulation of 
glycosphingolipids in the late endocytic system. 
Frances Platt discusses insights into the cell biology 
of glycosphingolipid storage disorders and the 
development of treatments. 

Vivian Peirce and colleagues then go on to highlight 
the biology of various types of adipose tissues, and 
their therapeutic potential in metabolic disease. 

The Perspective by Gerald Shulman and colleagues 
discusses the roles of hepatic lipids in non-alcoholic 
fatty liver disease, which is now the most common liver 
disorder in the Western world. 

Finally, Charles Serhan reviews the anti- 
inflammatory and immunoregulatory actions of 
omega-3-derived lipid resolving mediators and their 
therapeutic potential. 
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Lipid landscapes and pipelines 
in membrane homeostasis 


Joost C.M. Holthuis'? & Anant K. Menon? 


The lipid composition of cellular organelles is tailored to suit their specialized tasks. A fundamental transition in the lipid 
landscape divides the secretory pathway in early and late membrane territories, allowing an adaptation from biogenic to 
barrier functions. Defending the contrasting features of these territories against erosion by vesicular traffic poses a major 
logistical problem. To this end, cells evolved a network of lipid composition sensors and pipelines along which lipids are 
moved by non-vesicular mechanisms. We review recent insights into the molecular basis of this regulatory network and 
consider examples in which malfunction of its components leads to system failure and disease. 


ciate in water is the physical basis for spontaneous membrane 

formation, and renders membranes virtually impermeable to 
polar solutes. This fundamental principle enabled the first cells to 
segregate their internal constituents from the external environment 
and is repeated within nucleated cells to form discrete organelles. 
Whereas a single lipid species is sufficient to generate a barrier to 
unregulated solute movement, membrane lipids in eukaryotic cells 
are characterized by a remarkable structural diversity'”. In addition, 
the lipid composition and transbilayer arrangement among organelles 
show striking variations, and there is compelling evidence that the 
collective properties of the bulk lipids play a profound part in defin- 
ing organelle identity and function. 

Of particular interest are the lipid-induced changes in physical 
membrane properties such as bilayer thickness, lipid packing density 
and surface charge that mark the transition from early to late orga- 
nelles in the secretory pathway”. These alterations are conserved 
from yeast to human and provide specific cues for membrane proteins 
to allow temporal and spatial control over the molecular events that 
govern secretion. This requires not only elaborate metabolic machin- 
ery for creating a diverse repertoire of membrane lipids, but also ways 
to preserve the unique lipid mixtures of early and late secretory orga- 
nelles. Without mechanisms imposing selectivity and directionality 
on lipid movement, organelle identity and function would vanish in 
the face of extensive vesicular trafficking. 

In this Review, we discuss how cells coordinate lipid metabolism 
and transport to achieve membrane lipid homeostasis in organelles 
connected by vesicular trafficking. As the underlying molecular prin- 
ciples involve collective phenomena beyond specific interactions 
among proteins and lipids, we first describe how lipid composition 
influences the generic physical properties of the membrane and how 
these properties change from early to late secretory organelles. We 
discuss how these changes are monitored by membrane sensors that 
relay signals to the lipid metabolic machinery when organellar lipid 
homeostasis is challenged. We then focus on the long-standing enigma 
that lipid traffic in cells is largely independent of membrane traffic. 
Current models suggest that lipid traffic is mediated by cytoplasmic 
transfer proteins that operate at contact sites between early and late 
secretory organelles. An appealing concept is that these so-called lipid 
pipelines’ are exploited to bypass vesicular connections for optimal 
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control over lipid exchange between secretory organelles. Finally, we 
discuss mounting evidence of extensive co-regulation among individ- 
ual lipid classes and how perturbation of the underlying homeostatic 
machinery can culminate in system failure and disease. 


Lipid repertoire of the secretory pathway 

The organelles along the secretory pathway have major differences in 
lipid composition that help to shape their specialized tasks. Whereas 
some rare lipids (for example, phosphoinositides) contribute to orga- 
nelle identity by mediating specific molecular recognition events, 
we focus on how the collective action of the bulk lipids specify orga- 
nelle function by influencing generic physical membrane parameters 
such as fluidity, thickness, lipid packing density and surface charge 
(Fig. 1a). For proper organelle function, these parameters must be 
kept within an appropriate range. To better understand the underly- 
ing principles, we first discuss some physico-chemical properties of 
the main categories of membrane lipids found along the secretory 
pathway — phospholipids, sphingolipids and sterols. 

Phospholipids constitute the bulk of the membrane’s lipid matrix. 
They contain two hydrophobic acyl chains and a phosphate head 
group ester linked to a glycerol backbone. Further classification is 
based on the structure of the phospholipid head group, which is 
zwitterionic in phosphatidylcholine (PtdCho) and phosphatidyle- 
thanolamine (PtdEtn), or anionic in phosphatidylserine (PtdSer) and 
phosphatidylinositol (PtdIns). Each phospholipid class comprises a 
multitude of molecular species defined by the length and degree of 
saturation in their acyl chains. Both head group and acyl chain com- 
position influence the physical properties of the membrane®. PtdCho 
accounts for more than 50% of all phospholipids in most eukaryotes 
and usually contains one cis-unsaturated acyl chain, such as oleic 
acid (C18:1). The rigid kink of the cis-double bond lowers the pack- 
ing density of the acyl chains, which increases membrane fluidity’. 
Facilitated by its cylindrical shape, PtdCho self-assembles spontane- 
ously into closed bilayers that adopt a liquid crystalline state at physi- 
ological temperature. Thus, PtdCho is ideally suited to providing a 
stable and fluid matrix for cellular membranes, a pre-condition for 
the living state. By contrast, PtdEtn is a conically shaped phospholipid 
that imposes negative curvature stress on the membrane owing to 
its relatively small polar head group. The non-bilayer propensity of 
PtdEtn increases with acyl chain unsaturation, creating lipid-packing 
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Figure 1 | Membranes of early and late secretory organelles have 
contrasting lipid compositions and physical properties. a, Physical 
membrane properties are influenced by lipid composition. Fluidity is promoted 
by lipids with short, unsaturated fatty acids. The double bonds introduce kinks 
that lower the packing density of the acyl chains and inhibit transition of the 
membrane from a fluid to a solid gel phase. Phospholipids typically pair a 
saturated with an unsaturated fatty acid, which is an elegant means to prevent 
the two types of acyl chains from phase separating when the temperature 
drops. Thickness is promoted by acyl-chain length and sterols, which order 
and stretch the acyl chains. High levels of packing defects are found in 

lipids with unsaturated acyl chains and small head groups. Surface charge is 
determined by the presence of anionic lipids such as phosphatidylserine and 
phosphoinositides. Curvature is determined by lipid shape. Lipids with a small 
area ratio of polar head to acyl chain (creating a cone shape) induce negative 
curvature, lipids with an equal head to chain ratio (creating a cylinder shape) 


defects that facilitate membrane fusion and influence the binding and 
activity of peripheral membrane proteins*”. Despite their relatively 
low abundance, PtdSer and PtdIns are key determinants of membrane 
surface charge and mediate functional interactions with positively 
charged regions of peripheral and integral membrane proteins'”"’. 
Distinct from phospholipids, sphingolipids primarily contain satu- 
rated or trans-unsaturated acyl chains linked to a serine backbone. 
This hydrophobic structure, termed ceramide, is decorated with a 
phosphocholine head group in the case of sphingomyelin or with 
saccharides in the case of glycosphingolipids. Owing to its acyl-chain 
composition, sphingomyelin forms a taller, narrower cylinder than 
PtdCho, increasing its packing density in the membrane. Conse- 
quently, at physiological temperatures a sphingomyelin bilayer exists 
in a solid gel phase with tightly packed, immobile acyl chains'’*""*. 
However, the addition of sterols renders these membranes fluid”. 
Sterols are abundant apolar membrane lipids with an inflexible core 
formed by four fused rings, with cholesterol predominating in mam- 
mals. By interfering with acyl-chain packing, sterols inhibit the tran- 
sition of the membrane to the solid gel state. At the same time, sterols 
rigidify fluid membranes by reducing the flexibility of neighbouring 
unsaturated acyl chains, thereby increasing membrane thickness 
and impermeability to solutes (the so-called condensing effect of 
sterols)'*'°. Thus, sterols allow cells to minimize unregulated solute 
movement across their membranes while keeping them fluid over a 
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are neutral, and those with a much larger head compared with the acyl chain 
area (creating an inverted cone shape) induce a positive curvature. PtdCho, 
phosphatidylcholine; PtdEtn, phosphatidylethanolamine; SM, sphingomyelin. 
b, The endoplasmic reticulum (ER) has a thin bilayer, loose lipid packing and 
neutral cytoplasmic surface charge adapted for its biogenic function. The 
plasma membrane (PM) has a thick bilayer, tight lipid packing and negative 
cytoplasmic surface charge adapted for its barrier function. These contrasting 
physical properties are reflected in the length (20 aa and 25 aa, respectively) and 
geometry of transmembrane domains (TMDs) of ER and plasma-membrane- 
resident proteins. Membrane traffic between the ER and plasma membrane 
passes through the Golgi, a polarized and multi-cisternal organelle in which 
lipid sorting must occur to preserve the unique lipid compositions and, hence, 
specialized functions of the ER and plasma membrane. Note that lipid traffic 
can bypass membrane traffic at sites at which the ER contacts the trans-Golgi, 
plasma membrane or mitochondria. 


broad range of acyl-chain compositions and temperatures. 

Restriction of the membrane lipid repertoire of eukaryotic cells is 
not well tolerated. Attempts to manipulate the ratio between cylin- 
drical and conical lipids indicate that this ratio is tightly controlled 
through compensatory mechanisms”. Cells defective in PtdSer 
biosynthesis preserve membrane electrostatics by upregulating the 
PtdIns content of their membranes”. The requirement of sphingolip- 
ids for cell growth in yeast can be bypassed by a suppressor mutation 
that enables the production ofa novel set of phospholipids, the struc- 
tural and physico-chemical properties of which mimic those of the 
sphingolipids’’. We discuss how eukaryotic cells exploit a structurally 
diverse set of membrane lipids to sustain the advanced specialization 
of their secretory apparatus. 


Lipid composition subserves secretory organelle function 

The endoplasmic reticulum (ER) is the gateway of the secretory 
pathway and the principal site for membrane protein insertion and 
folding”. It is also the cell’s main lipid factory, in which the bulk of 
phospholipids and sterols, as well as substantial amounts of storage 
lipids such as triacylglycerol and stery] esters, are produced (Fig. 2). 
In addition, the ER synthesizes ceramide, the precursor of all sphin- 
golipids. Besides exporting ceramide, the ER supplies a large portion 
of membrane lipids to the Golgi and plasma membrane because these 
distal secretory organelles have little or no capacity to produce their 
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Figure 2 | Membrane lipid biosynthesis. Principal pathways for the 
production of sterols, sphingolipids and phospholipids in mammals, and the 
key biosynthetic enzymes (pale green) and rate-limiting enzymes (green) 
involved. AGPAT, 1-acylglycerol-3-phosphate-O-acyltransferase; CCT, 
CTP:phosphocholine cytidylyltransferase; CDP-Cho, cytidinediphosphate 
choline; CDP-DAG, cytidine diphosphate-diacylglycerol; CDP-Eth, cytidine 
diphosphate ethanolamine; CDS, cytidine diphosphate diacylglycerol 
synthase; CEPT, choline/ethanolaminephosphotransferase; CERS, ceramide 
synthase; Cho, choline; CL, cardiolipin; CLS, cardiolipin synthase; CPE, 
ceramide phosphoethanolamine; CPT, cholinephosphotransferase; 


own. Despite an extensive exchange of material by membrane traf- 
ficking, the ER and plasma membrane show remarkable differences 
in their lipid composition (Fig. 1b). For instance, sterols are rare 
in the ER (5 mol% of lipids) but abundant in the trans-Golgi and 
plasma membrane (30-40 mol%)'*”". Sterols preferentially interact 
with lipids bearing saturated fatty acyl chains and bulky head groups, 
such as sphingolipids’’. The bulk of sphingolipids are synthesized in 
the lumen of the trans-Golgi from ceramide supplied by the ER”. 
Analogous to sterols, sphingolipids accumulate in the plasma mem- 
brane while their levels are low in the ER. Moreover, phospholipids 
are more saturated at the plasma membrane than at the ER” owing 
to a substantial remodelling of their acyl chains®. Thus, the plasma 
membrane is characterized by tight lipid packing due to a high con- 
centration of saturated lipid species and sterols, making it a rigid and 
thick barrier between the intracellular and external milieu. 

The transition from a thin and loosely packed membrane into 
a thick and rigid one occurs in the trans-Golgi and is driven by 
sphingolipid production and sterol supply” (discussed later). The 
mechanism by which sphingolipids and sterols are moved up a 
concentration gradient into the plasma membrane is incompletely 
understood, but their preferential interactions and co-depletion from 
the highly curved coat protein complex 1 (COPI) vesicles that medi- 
ate retrograde membrane trafficking might have a role”**”*. In any 
case, the step-change in membrane thickness caused by the abrupt 
increase in the concentration of sterols and sphingolipids is reflected 
in the length and composition of transmembrane domains (TMDs) 
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FAS, fatty acid synthase; GCS, glucosylceramide synthase; HMGCS, 
hydroxymethylglutaryl coenzyme A (CoA) synthase; HMGCR, 3-hydroxy-3- 
methylglutaryl-CoA reductase; Ins, inositol; PA, phosphatidic acid; PAP (also 
known as LIPIN), phosphatidic acid phosphatase; P-Cho, phosphocholine; 
PG, phosphatidylglycerol; PGP, phosphatidylglycerolphosphate; 

PIP, phosphoinositide; PtdCho, phosphatidylcholine; PtdEtn, 
phosphatidylethanolamine; PtdInsS, phosphatidylinositol synthase; PtdSer, 
phosphatidylserine; SMS, sphingomyelin synthase; SMSr, sphingomyelin 
synthase-related enzyme; SPT, serine palmitoyltransferase; TAG/CE, 
triacylglycerol/cholesterol ester. 


of single-spanning membrane proteins that populate the secretory 
pathway. TMDs of ER and Golgi-resident proteins are significantly 
shorter than those of plasma-membrane proteins and have more 
bulky residues in the exoplasmic hemi-TMD’ (Fig. 1b). Both TMD 
length and volume are crucial determinants of Golgi retention or exit 
to the plasma membrane”. Selective associations between match- 
ing lipids and TMDs can drive their co-segregation in model mem- 
branes”. This implies that the Golgi exploits physical principles such 
as hydrophobic mismatch and packing density to solve a complex 
logistical problem, namely to preserve the unique mixtures of mem- 
brane proteins and lipids that allow the ER and plasma membrane to 
execute their fundamentally distinct tasks. 

Recent work indicates that the Golgi also marks a transition 
point in membrane electrostatics. Membrane surface charge largely 
depends on the content of anionic phospholipids such as PtdSer, the 
levels of which range from a few per cent in the ER to more than 10% 
at the plasma membrane”. Studies with genetically encoded PtdSer 
probes indicate that the bulk of PtdSer resides in the luminal leaflet 
of the ER, an asymmetric distribution opposite to that of the plasma 
membrane in which PtdSer faces the cytoplasm”. Bulk phospholip- 
ids, including PtdSer, are synthesized in the cytoplasmic leaflet of 
the ER and then randomly ‘scrambled’ by a bidirectional flippase to 
allow membrane expansion”. This suggests that PtdSer is trapped 
in the ER lumen, possibly through calcium-mediated interactions 
with luminal proteins. PtdSer asymmetry in the ER is consistent with 
the finding that the phospholipid monolayer around lipid droplets, 


© 2014 Macmillan Publishers Limited. All rights reserved 


which is thought to arise from the cytoplasmic leaflet of the ER, con- 
tains hardly any PtdSer*’. Most of the Golgi does not react with the 
cytoplasmic PtdSer probe, except for the trans-Golgi, which contains 
type 4 P-type ATPases (P,-ATPases) that translocate PtdSer and other 
phospholipids from the luminal to the cytoplasmic leaflet”. Also of 
note is experimental evidence that PtdIns is not equally distributed 
over the ER but concentrated in highly dynamic sub-compartments™, 
suggesting that the cytoplasmic surface of the ER is overall relatively 
neutral. This is in sharp contrast to the cytoplasmic surface of the 
trans-Golgi and plasma membrane, at which PtdSer and phospho- 
inositides contribute to negative charge. 

Thus, the Golgi defines a demarcation line between two broad 
membrane territories with distinct physical and functional features**. 
One is the territory of thin bilayers, loose lipid packing and neutral 
cytoplasmic surface charge, which is adapted for biogenic functions 
and comprises the ER and cis-Golgi. The other is the territory of 
thick bilayers, tight lipid packing and negative cytoplasmic surface 
charge, adapted for barrier functions and comprising the trans-Golgi 
and plasma membrane. The contrasting features of these territories 
provide specific cues for both integral and peripheral membrane pro- 
teins, allowing temporal and spatial control over the key events that 
occur along the secretory pathway. For instance, late secretory orga- 
nelles recruit peripheral membrane proteins that contain polybasic 
motifs'*"". Early secretory organelles, on the other hand, bind pro- 
teins with neutral amphipathic lipid packing sensor (ALPS) motifs, 
containing bulky hydrophobic residues that readily insert into lipid- 
packing defects***. We discuss two basic mechanisms by which cells 
preserve the unique lipid compositions of early and late membrane 
territories. One depends on proteins that are capable of sensing alter- 
ations in lipid composition by monitoring physical properties of the 
membrane. The other uses pipelines to bypass vesicular connections 
for optimal control over lipid exchange between the two territories. 


Lipid composition sensors 

The ER is the principal supplier of bulk lipids to other organelles. 
Its extensive network of tubular and planar membranes interfaces 
at contact sites with both secretory (for example, plasma membrane 
or trans-Golgi) and non-secretory organelles (for example, mito- 
chondria or lipid droplets) to facilitate control over lipid transport 
and metabolism in response to varying cellular demands**””. The 
ER is also exceptionally sensitive to perturbations in its unique lipid 
composition and biophysical properties. For instance, an imbalance 
between saturated and unsaturated phospholipids readily affects 
ER biogenic activity, inducing a stress response that can trigger cell 
death**””. Interestingly, the biogenic membranes of some bacteria 
harbour thermosensors that control phospholipid desaturation. Of 
these, the thermosensor DesK from Bacillus subtilis is mechanisti- 
cally the best understood. DesK seems to sense the thickening of 
the membrane as the temperature drops (Fig. 3a). This is thought 
to force a cluster of polar residues into a more apolar environment, 
resulting in a conformational change that triggers the autokinase 
activity of DesK*’. Subsequently, phosphorylated DesK activates 
DesR, a transcriptional activator ofa fatty acid desaturase. Promotion 
of membrane fluidity by the latter switches DesK from a kinase to a 
phosphatase, shutting down expression of the desaturase. 

How the ER maintains an optimal level of phospholipid desatura- 
tion remains to be established, but there is increasing evidence that 
the underlying mechanisms are interconnected with the organelle’s 
protein quality control system, which is called the unfolded protein 
response (UPR)**’. The integral membrane protein IRE1 is an ER 
stress sensor that detects misfolded proteins in the ER lumen and 
activates UPR. This response activates transcription of lipid biosyn- 
thetic genes, allowing ER membrane expansion to counterbalance the 
protein-folding stress in its lumen“. Conversely, an excess of satu- 
rated fatty acids and other perturbations in phospholipid biosynthesis 
also activates the UPR®™. Intriguingly, IRE1 that lacks the luminal 
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unfolded protein-sensing domain retains the ability to activate the 
UPR by lipid stress**, suggesting a dual role for IRE1 as membrane 
sensor of lipid perturbations in the ER (Fig. 3b). 

Sterol excess blocks protein import into the ER”, whereas sterol 
depletion results in mistargeting of tail-anchored mitochondrial 
proteins to the ER“. Therefore, sterol levels in the ER of mammals 
are kept within narrow limits by a sophisticated feedback system 
that revolves around the membrane-bound transcription factor 
SREBP2 and the sterol-sensing protein SCAP. When ER sterol levels 
drop below 5 mol%, SREBP2 is escorted from the ER to the Golgi 
by SCAP”. In the Golgi, SREBP2 undergoes proteolytic cleavage to 
release a transcription factor that activates genes involved in sterol 
biosynthesis and uptake. Above the 5 mol% threshold, sterol-bound 
SCAP and SREBP2 remain in the ER, preventing SREBP2 process- 
ing. Thus, the SCAP-SREBP2-mediated feedback loop ensures a 
constant supply of ER-derived sterols to distal organelles within a 
range compatible with the biogenic function of the ER”. A second 
SREBP isoform, SREBP1, is part of an independent feedback loop that 
specifically responds to changes in PtdCho levels’. PtdCho deple- 
tion causes a redistribution of the SREBP-activating proteases from 
the Golgi to the ER, followed by proteolytic activation of SREBP1 
and the induction of fatty-acid-biosynthetic genes. Apparently, the 
enhanced negative curvature stress associated with increasing the 
PtdEtn/PtdCho ratio affects curvature sensors that control COPI- 
mediated retrograde trafficking, causing a shift in the distribution 
of the SREBP-activating proteases towards the ER”. 

Because PtdCho is a major membrane lipid, its production is a 
crucial point of control in global membrane homeostasis. The rate- 
limiting step in PtdCho biosynthesis is the formation of cytidine 
diphosphate-choline, a reaction catalysed by cytidine triphosphate 
(CTP):phosphocholine cytidylyltransferase (CCT; Fig. 2). CCT con- 
tains an amphipathic helix — called domain M — that silences the 
activity of the enzyme in its soluble form”. In the presence of PtdCho 
bilayers with a high content of conical lipids such as diacylglycerol or 
PtdEtn, domain M undergoes a conformational switch that creates a 
hydrophobic face for membrane binding*'. Association with mem- 
branes relieves self-inhibition and drastically increases the affinity of 
CCT for its substrate CTP (Fig. 3c). By contrast, membrane associa- 
tion and activity of CCT is reduced when diacylglycerol/PtdCho or 
PtdEtn/PtdCho ratios are low. Thus, CCT seems to act as a general 
sensor of lipid packing defects that signals a demand for PtdCho 
biosynthesis to allow membrane expansion or to prevent its transi- 
tion into a porous state. Consistent with this model, CCT is recruited 
to lipid droplets in the growing phase to allow production of more 
PtdCho molecules to surround the expanding oily core” (Fig. 3c). 


Lipid pipelines 

Newly synthesized lipids are exported from the ER as components 
of secretory vesicles, or through pipelines operated by cytoplasmic 
lipid transfer proteins (LTPs)*””’. The latter mechanism is crucial for 
supplying ER lipids to mitochondria and other organelles that are not 
connected by vesicular trafficking but rely on lipid import for proper 
function™. Intriguingly, bulk transport of various lipids between the 
ER and plasma membrane continues unabated when vesicular traf- 
ficking is shut off***. Thus, lipid pipelines are also an important 
means by which the ER delivers lipids to distal secretory organelles. 
In fact, pipelines run from the ER to the trans-Golgi and plasma 
membrane to ferry sterols**°>”’, ceramide”, PtdCho”, PtdEtn®®”, 
PtdSer® and PtdIns. A pipeline also transports glucosylceramide, 
the precursor of complex glycosphingolipids, to the trans-Golgi™. 
Pipelines require LTPs because spontaneous movement of most lipids 
between membranes occurs too slowly to support cell functions”, 
Beyond their obvious function in the supply of lipids, we propose 
that two other aspects of pipelines are key to membrane homeosta- 
sis. First, they ease lipid sorting and remodelling stress in the Golgi, 
thereby enabling early and late secretory organelles to maintain their 
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Figure 3 | Lipid composition sensors in biogenic organelles. 

a, Thermosensor Desk is a histidine kinase acting at the top of a regulatory 
cascade controlling the synthesis of unsaturated fatty acids in Bacillus 

subtilis. Following a drop in temperature, the membrane spans of the kinase 
sense an increase in bilayer thickness owing to an increased ordering of the 
lipid acyl chains. This promotes a kinase dominant state of DesK, which 
autophosphorylates and then transfers the phosphate group to DesR. Phospho- 
DesR activates transcription of the des gene coding for a A5-desaturase. A5-Des 
desaturates the acyl chains of membrane phospholipids, increasing membrane 
fluidity. This change promotes the phosphatase-dominant state of DesK, leading 
to DesR dephosphorylation and turning off transcription of the des gene. b, 
IRE] is a conserved ER-resident stress sensor that can trigger an unfolded 
protein response (UPR) through a luminal domain that monitors the unfolded 
protein load in the ER. In addition, IRE1 uses its membrane span to detect 
membrane perturbations such as an increase in lipid saturation, which can 
signal UPR independently of the luminal stress-sensing domain. On activation, 
IRE] is autophosphorylated and assembles into high-order oligomers. 


52 | NATURE | VOL 510 | 5 JUNE 2014 


Major packing defects 


>} Inactive 


Restored to minor packing defects 


Signalling proceeds by the non-conventional splicing of XBP1 mRNA, which 
is initiated by IRE1’s cytoplasmic RNase domain. Only the spliced mRNA is 
translated to produce an active transcription activator of UPR genes, which 
code for folding enzymes and lipid biosynthetic machinery, to restore ER 
membrane homeostasis. SFA, saturated fatty acid. c, CTP:phosphocholine 
cytidylyltransferase (CCT), the rate-limiting enzyme in phosphatidylcholine 
(PtdCho) biosynthesis, interconverts between an inactive soluble and active 
membrane bound form in response to changes in membrane lipid packing 
density. An amphipathic helix, called domain M, silences the activity of the 
enzyme in its soluble form. Lipid packing defects arising from increased levels 
of conical lipids (for example, diacylglycerol (DAG), phosphatidylethanolamine 
(PtdEtn)) or by membrane expansion (for example, during lipid droplet (LD) 
growth) are sensed by domain M, creating a hydrophobic face for membrane 
binding. Association of domain M with membranes relieves self-inhibition, 
increasing the affinity of CCT for its substrate CTP. This then enhances 
biosynthesis of PtdCho to prevent transition of the membrane into a porous 
state. C(E)PT, choline/ethanolaminephosphotransferase. 
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compositional identity in the face of continuous vesicular trafficking. 
Second, they provide a rapid-response mechanism to facilitate regula- 
tory crosstalk among the three main lipid classes. After discussing 
how pipelines operate, we consider how they could help to preserve 
the contrasting features of the early and late membrane territories of 
the secretory pathway. 

At first approximation, LTPs are simply lipid exchangers, similar 
to cyclodextrins that bind and exchange hydrophobic compounds 
(Fig. 4a). Many LTPs were first identified by their ability to exchange 
lipids between vesicle populations in the test tube. The exchange 
mechanism can be thought of as a multi-step process. Thus, a par- 
tially desorbed membrane lipid is reversibly taken up into the LTP’s 
lipid binding pocket. A conformational change might then occur, 
causing the binding pocket to be closed off, protecting the lipid from 
the aqueous cytoplasm. The reverse process results in the release of 
lipid at an acceptor membrane. When extending this idea to cells, 
LTP-operated pipelines would function as bidirectional routes of lipid 
transfer, equilibrating lipids between the cytoplasmic faces of orga- 
nelles. As already discussed, the situation is more complex because 
bulk lipid composition varies greatly along the secretory pathway, and 
transport often entails movement against a concentration gradient. 
Several factors must be considered in adapting this simple model of 
LTP-mediated lipid equilibration to the cellular situation. 

First, many LTPs possess specific membrane interaction motifs 
such as the FFAT motif that engages ER receptors™, and a PH domain 
that binds phosphatidylinositol-4-phosphate (PtdIns(4)P) in late 
secretory organelles” (Fig. 4b). The presence of these motifs ensures 
a higher frequency of productive encounters between LTPs and par- 
ticular membranes, ensuring specificity. In addition, LTPs, such as 
CERT®”*”! and oxysterol binding proteins (OSBPs)*, that have both 
motifs can engage two membranes simultaneously (Fig. 4c). These 
LTPs probably operate within the narrow (<30 nm) cytoplasmic gap 
created by membrane contact sites between the ER and trans-Golgi, 
potentially increasing the lipid exchange rate’. Second, transfer 
specificity alone cannot account for the fact that many lipids enter- 
ing an ER pipeline eventually accumulate in an acceptor organelle. 
For this, the cell uses three general strategies: metabolic trapping, 
thermodynamic trapping and heterotypic lipid exchange. Ceramide 
transport provides a good example of metabolic trapping (Fig. 4c). 
CERT exchanges ceramide bidirectionally between the ER and 
trans-Golgi, but conversion of ceramide to sphingomyelin in the lat- 
ter compartment ensures net one-way transport™. Thermodynamic 
trapping underlies a proposed model for sterol movement between 
the ER and the plasma membrane”. In this model, the association of 
sterols with saturated plasma membrane lipids (Fig. 1b) provides a 
thermodynamic trap. More precisely, the chemical activity of sterol 
in the ER and plasma membrane is the same, which is consistent with 
LTP-mediated equilibratory exchange of sterols between these com- 
partments*””, but the chemical activity coefficient for sterols in the 
plasma membrane is significantly lower than that in the ER, enabling 
higher plasma-membrane sterol concentrations”. 

Thermodynamic traps might also help to explain how cells con- 
trol the transbilayer and territorial asymmetry of PtdSer, as already 
discussed. Newly synthesized PtdSer is transported to the plasma 
membrane by secretory vesicles, as well as by a pipeline mechanism 
(Fig. 4d, right panel). PtdSer is carried in the luminal leaflet of secre- 
tory vesicles until it reaches the trans-Golgi at which it is flipped 
to the cytoplasmic leaflet by PtdIns(4)P-dependent P,-ATPases*””. 
Golgi-derived vesicles then move PtdSer to the cytoplasmic face 
of the plasma membrane. Intersecting this delivery mechanism is 
a PtdSer pipeline operated by members of the OSBP family, such 
as Osh6 and Osh7 in yeast” (Fig. 4d, left panel). To maintain the 
cytoplasmically oriented pool of PtdSer in the plasma membrane in 
the presence of a PtdSer pipeline, it is likely that PtdSer is trapped 
by direct electrostatic interactions with the polybasic domains of 
plasma-membrane-localized proteins. 
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Heterotypic lipid exchange involves the reciprocal transfer of two 
different lipids by an LTP, as suggested for PtdIns-PtdCho exchang- 
ers’. Coupled with a metabolic trap for one of the lipids and a ther- 
modynamic trap for the other, this exchange principle can account 
for net unidirectional movement of lipids, as recently shown for the 
transport of PtdIns(4)P and sterols by OSBP and its yeast counter- 
part Osh4 (refs 53, 73-75) (Fig. 4c). OSBP and Osh4 bind both sterol 
and PtdIns(4)P in a mutually exclusive manner. The proteins capture 
sterol in the ER and exchange it for PtdIns(4)P in the trans-Golgi. 
On returning to the ER, the reverse exchange occurs such that the 
LTP discharges PtdIns(4)P and is reloaded with sterol. The potential 
reversibility of this process is eliminated by a metabolic trap, which 
drives anterograde sterol movement. Thus, PtdIns(4)P is hydrolysed 
to PtdIns by an ER-localized phosphatase, ensuring that the LTP — 
which cannot bind PtdIns — departs the ER loaded with sterol. There 
are two further aspects of this model that must be considered. First, a 
thermodynamic trap for sterol at the trans-Golgi is necessary other- 
wise OSBP or Osh4 might not be able to acquire PtdIns(4)P efficiently 
at the trans-Golgi in the face of sterol excess. This requirement is sat- 
isfied by the local production of sphingolipids. Second, PtdIns pools 
must be regenerated at the trans-Golgi to enable fresh PtdIns(4)P 
synthesis. Because PtdIns is not synthesized in the trans-Golgi, it 
seems likely that the PtdIns generated by PtdIns(4)P hydrolysis in the 
ER is recycled to the trans-Golgi by a pipeline operated by PtdIns/ 
PtdCho exchangers such as Sec14 or Nir2 (ref. 37). 


Regulatory crosstalk 
The membrane contact site between the ER and trans-Golgi is a nexus 
for the coordinate regulation of sphingolipid and sterol levels that 
help to define late secretory organelles. The delivery of sterol to the 
trans-Golgi by OSBP in mammals and Osh4 in yeast activates a trans- 
Golgi localized PtdIns-4-kinase, causing an increase in PtdIns(4)P 
levels’’. In mammals, this results in the recruitment of CERT (Fig. 4c) 
and promotes delivery of ceramide for sphingomyelin synthesis to 
set up a thermodynamic trap for sterols. A related process probably 
occurs in yeast, although the ceramide carrier has yet to be identi- 
fied’’. PtdCho is required in the PtdIns/PtdCho exchange process 
that restores Golgi PtdIns levels”, as well as for sphingomyelin bio- 
synthesis (Fig. 2); because PtdCho is a bulk lipid, its high levels in the 
trans-Golgi ensure that these seemingly antagonistic demands are 
easily met. Thus, a number of pipelines intersect to ensure that sphin- 
golipid precursors reach the trans-Golgi in synchrony with sterol 
arrival, allowing a fundamental transition in the lipid landscape that 
divides the secretory pathway in early and late membrane territories. 
In addition to the coordinate regulation of the sphingolipid and 
sterol content of the plasma membrane, a remarkable homeostatic 
circuit regulates plasma membrane lipid organization by linking the 
transbilayer phospholipid asymmetry of the plasma membrane to 
its sphingolipid content’ (Fig. 5). This circuit relies on a number of 
protein kinases, the action of which, in this context, has been best 
studied in yeast. Thus, the TORC2 protein kinase complex responds 
to plasma membrane sphingolipid levels: when sphingolipid lev- 
els are low, TORC2 phosphorylates the Ypk1 and Ypk2 kinases to 
generate two outcomes. The first is an increase in the production 
of sphingolipid precursors in the ER. The rate-limiting step in the 
synthesis of sphingolipid precursors is catalysed by serine palmi- 
toyltransferase (SPT; Lcb1 or Lcb2 in yeast; Fig. 2) and regulated by 
Orm proteins” *". SPT, Orm proteins and the PtdIns(4)P phosphatase 
Sacl form the SPOTS complex in the ER membrane. SPT activity in 
the SPOTS complex is auto-inhibited, but this inhibition is relieved 
on Ypk1- and Ypk2-mediated phosphorylation of the Orm proteins. 
The second result of TORC2-mediated phosphorylation of Ypk1 
and Ypk2 is to phosphorylate and downregulate the flippase kinases 
Fpk1 and Fpk2, which function as flippase activators*”**. Thus, sphin- 
golipid levels in the exoplasmic leaflet of the plasma membrane are 
sensed and corrected by fresh synthesis of precursors. This occurs 
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Figure 4 | Lipid pipelines operated by lipid transfer proteins. a, Simple 
model for LTP-mediated lipid exchange, resulting in the equilibration of lipid 
cargo between donor and acceptor membranes. b, Schematic structure of LTPs, 
such as CERT and OSBP, that carry dual membrane targeting motifs as well 

as a lipid-binding domain. c, Ceramide and sterol pipelines at endoplasmic 
reticulum (ER)-trans-Golgi membrane contact sites in mammals. CERT (left) 
is inactive as a phospho-protein because of reciprocal masking of its PH domain 
and lipid-binding START domain”. On dephosphorylation, it simultaneously 
engages phosphatidylinositol 4-phosphate (PtdIns(4)P) at the trans-Golgi and 
VAP-A at the ER, catalysing movement of ceramide between the cytoplasmic 
faces of the two organelles. Ceramide flips to the luminal side of the trans-Golgi 
at which it is converted to sphingomyelin (SM) by its synthase (SMS). This 
metabolic trapping event enables continued delivery of ceramide from the ER. 
Like CERT, OSBP (right) exists in an inactive form in which its functional motifs 
are masked. Oxysterol binding relieves this autoinhibition enabling OSBP to 
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engage the ER-trans-Golgi contact site. OSBP uses heterotypic lipid exchange 

to move sterol between the ER and trans-Golgi, and PtdIns(4)P in the reverse 
direction. At the ER, PtdIns(4)P is hydrolysed to PtdIns. This is returned to the 
trans-Golgi by a PtdIns—phosphatidylcholine (PtdCho) exchanger (not shown), 
to be converted to PtdIns(4)P by a sterol-activated phosphatidylinositol-4-OH 
kinase (PI(4)K). d, A phosphatidylserine (PtdSer) pipeline operated by Osh6 
and Osh7 in yeast transfers PtdSer synthesized from CDP-diacylglycerol (DAG) 
and serine on the cytoplasmic surface of the ER to the plasma membrane 

(PM) (left). Trapping events mediated by calcium-bound or positively charged 
proteins ensure that, at steady state, PtdSer is mainly localized in the lumen of 
the ER (despite the presence of a bidirectional flippase) and the cytoplasmic 
face of the plasma membrane. PtdSer in the luminal leaflet of the ER enters 
secretory vesicles and is delivered to the trans-Golgi (right). Flippase activity 

of P,-ATPases moves PtdSer to the cytoplasmic face. Thus, Golgi-derived 
transport vesicles carry PtdSer to the plasma-membrane cytoplasmic face. 
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in coordination with a P,-ATPase-catalysed redistribution of ami- 
nophospholipids to the opposite leaflet, presumably to maximize 
impermeability of the plasma membrane (discussed later). 


System failure and disease 

Numerous links between lipid imbalances and human pathologies 
underscore the importance of membrane lipid homeostasis'**. We 
focus on two examples that highlight the physical principles of lipid 
organization in early and late membrane territories in the context of 
liver disease. 

A surplus of free fatty acids contributes to liver failure in obesity and 
type 2 diabetes®**. Although fatty-acid-induced liver disease is gener- 
ally attributed to triglyceride accumulation in liver cells, recent data 
indicate that triglyceride accumulation might also have a protective role. 
The high toxicity of saturated fatty acids is partly due to alimited capac- 
ity of the liver cells to incorporate them into triglycerides**. Increased 
membrane saturation triggers ER stress (see earlier) and can profoundly 
disturb cellular homeostasis by affecting the function of membrane 
channels and transporters such as an ER-resident Ca**-pump, sarcoplas- 
mic/endoplasmic reticulum calcium ATPase (SERCA)*”**. Inhibition 
of SERCA impairs protein folding in the ER and can trigger UPR. This, 
in turn, enhances SREBP1 expression and stimulates fatty acid syn- 
thesis”, creating a positive feedback loop that might further aggravate 
the metabolic derangements and liver injuries caused by saturated fatty 
acids. Thus, perturbations in ER lipid packing density and homeostasis 
can have far-reaching pathophysiological consequences. Further insight 
into the underlying molecular principles might reveal new opportuni- 
ties for therapeutic interventions in common chronic diseases such as 
obesity and insulin resistance. 

Alterations in the lipid packing density of late secretory orga- 
nelles can have equally disastrous consequences. Progressive familial 
intrahepatic cholestasis-1 (PFIC1) is a potentially lethal liver dis- 
ease caused by mutations in ATP8B1 and characterized by a bile-salt 
secretion defect. ATP8B1 codes for a P,-ATPase that is thought to 
translocate PtdSer towards the cytoplasmic leaflet of the canalicular 
membrane of liver cells”. This membrane also harbours the bile salt 
export pump ABCB11. Interruption of the enterohepatic circulation 
of bile salts in patients with PFIC1 results in normalization of their 
hepatobiliary output, indicating that the bile-salt transport defect 
in PFIC1 is not a direct consequence of ATP8B1 dysfunction”. The 
enhanced recovery of PtdSer in bile from ATP8B1 mutant mice sup- 
ports a role for ATP8B1 as a PtdSer translocase”’. These mice also 
have a drastic increase in biliary output of canalicular cholesterol. 
Subsequent studies showed that the activity of the bile-salt pump 
is crucially dependent on the cholesterol content of the canalicu- 
lar membrane”. The exoplasmic leaflet of this membrane is rich in 
sphingolipids that are tightly packed with cholesterol to provide max- 
imal resistance against the detergent action of hydrophobic bile salts. 
By flipping excess PtdSer from the exoplasmic surface towards the 
cytoplasmic leaflet, ATP8B1 helps to preserve this barrier function. 
Loss of ATP8B1 function results in lipid scrambling, thereby reducing 
the lipid ordering in the exoplasmic leaflet and increasing its sensi- 
tivity to hydrophobic bile salts. Increased cholesterol extraction by 
bile salts reduces the cholesterol content of the bilayer, which in turn 
impairs the activity of the bile salt pump and causes cholestasis”. 


Outlook 

We have described the architecture of the secretory pathway in 
terms of contrasting membrane territories that are specialized for 
biogenic and barrier functions, and presented a homeostatic model 
in which lipid composition sensors and pipelines defend these terri- 
tories against erosion by ongoing vesicular trafficking. Surprisingly, 
some recently identified lipid composition sensors have turned out 
to be dual function proteins, with their second function mediating 
seemingly unrelated processes such as protein quality control**** and 
vesicular trafficking”. This reinforces the concept that membrane 
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Figure 5 | Crosstalk between plasma membrane sphingolipid levels 

and transbilayer lipid asymmetry. Low sphingolipid levels in the plasma 
membrane of yeast are sensed by the TORC2 protein kinase complex, which 
phosphorylates, and thus activates, the kinases Ypk1 and Ypk2. Ypk1/2- 
mediated phosphorylation events regulate sphingolipid synthesis and 

flippase activity. Synthesis of sphingolipid precursors is initiated by serine 
palmitoyltransferase (SPT; Lcb1 and Lcb2 in yeast), activity of which is inhibited 
in the SPOTS complex, which contains Orm1 and Orm2. Phosphorylation of 
Orm proteins by Ypk1/2 relieves SPT inhibition. Flippase activity is regulated by 
Fpk1- and Fpk2-mediated phosphorylation. Fpk1 and Fpk2 are downregulated 
on phosphorylation by Ypk1/2, resulting in a decrease in flippase activity. 
Although the SPOTS complex is conserved from mammals to yeast, Orm 
proteins in mammals lack the phosphorylation motifs involved in homeostatic 
regulation of sphingolipids. The Sacl phosphatidylinositol 4-phosphate 
(PtdIns(4)P) phosphatase negatively regulates sphingolipid biosynthesis from 
ceramide, thus providing an additional level of homeostatic control. ER, 
endoplasmic reticulum; PM, plasma membrane. 


lipid homeostasis is integral to a wide range of cellular processes. 
Because the dual roles of these proteins were not predicted, it seems 
likely that other examples of ‘moonlighting’ lipid composition sen- 
sors will be discovered. 

Lipid pipelines coordinate lipid supply and demand between the 
territories of the early and late secretory pathway. The LTPs that 
operate these pipelines belong to large and poorly characterized pro- 
tein families*’, and occasionally are found to shuttle lipids other than 
those anticipated from their annotation™. Distinct from their role in 
transporting lipids, some LTP family members have been proposed 
to act as sensors of membrane lipid composition”’. The ability of an 
LTP to extract lipid from a membrane could influence the nature 
of its interaction with downstream effectors, hence providing an 
indicator of membrane lipid status. Another interesting feature of 
some LTP family members is that they possess functional domains 
other than those required for lipid transfer. For example, the CERT- 
related proteins DLC1, DLC2 and DLC3 contain RhoGAP domains 
implicated in controlling cell migration and carcinogenesis”. The 
link between such dual functions is enigmatic and warrants further 
investigation. 

There are numerous proposed intracellular lipid trafficking steps 
that are known to be non-vesicular, but for which LTPs have not yet 
been identified*’. A prominent example of an ‘unassigned’ pipeline 
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is the one that moves sterols between the ER and plasma membrane. 
Although the reciprocal exchange of sterols and PtdIns(4)P by OSBP 
or Osh4 would result in the unidirectional movement of sterols from 
the ER to the trans-Golgi®”’, this mechanism cannot fully account 
for the equilibratory exchange of sterols between the ER and plasma 
membrane that is revealed by pulse-chase experiments*””*. Intrigu- 
ingly, conditional elimination of the entire OSBP protein family in 
yeast has almost no impact on the exchange of sterols between the 
ER and plasma membrane”. Thus, the LTPs that operate the sterol 
pipeline between the ER and plasma membrane remain to be identi- 
fied. The same holds for the lipid pipelines that run between the ER 
and non-secretory organelles, such as mitochondria, chloroplasts, 
lipid droplets and peroxisomes. 

Further development of the concepts presented in this Review 
will require new assays to quantify lipid movement within cells. 
Although excellent in vitro studies have defined the main mecha- 
nistic steps of lipid transfer in quantitative detail, live cell tracking 
of lipids is fraught with complications. This certainly remains a key 
challenge for the future. m 
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Sphingolipid metabolites in 
inflammatory disease 


Michael Maceyka! & Sarah Spiegel’ 


Sphingolipids are ubiquitous building blocks of eukaryotic cell membranes. Progress in our understanding of sphingolipid 
metabolism, state-of-the-art sphingolipidomic approaches and animal models have generated a large body of evidence 
demonstrating that sphingolipid metabolites, particularly ceramide and sphingosine-1-phosphate, are signalling mol- 
ecules that regulate a diverse range of cellular processes that are important in immunity, inflammation and inflammatory 
disorders. Recent insights into the molecular mechanisms of action of sphingolipid metabolites and new perspectives on 
their roles in regulating chronic inflammation have been reported. The knowledge gained in this emerging field will aid 
in the development of new therapeutic options for inflammatory disorders. 


named the brain lipid ‘sphingosine’ after the Sphinx, owing to 

its enigmatic chemical nature’. Sphingosine and its relatives 
continue to surprise and confound us today. These fatty amino 
alcohols are the backbone of a ubiquitous class of eukaryotic lipids, 
the sphingolipids, which include ceramide (N-acyl-sphingosine), 
sphingomyelin, and hundreds of different glycosphingolipids 
(Fig. 1a). Sphingolipids, like glycerolipids, are reservoirs of bioac- 
tive metabolites of profound importance to myriad cell signalling 
and pathological functions. The sphingolipid metabolites, ceramide, 
ceramide-1-phosphate (C1P), and sphingosine-1-phosphate (S1P) 
are emerging as important signalling molecules that regulate cell 
growth, survival, immune cell trafficking, and vascular and epi- 
thelial integrity, and are particularly important in inflammation 
and cancer””. 

The past decade has seen an explosive advancement in the field 
of sphingolipid signalling based on the convergence of several key 
aspects. First, most of the regulatory proteins and enzymes involved 
in sphingolipid metabolism and the receptors for S1P have been 
cloned. This allowed the generation of knockout mice, yielding 
insights into the physiological functions of sphingolipid metabo- 
lites. Second, the advent of advanced mass spectroscopic techniques 
has brought the ‘omics’ revolution to sphingolipids, allowing for the 
simultaneous analysis and quantification of multiple species. Third, 
specific agonists and antagonists of S1P receptors and inhibitors of 
signalling enzymes were developed. The chief development among 
these was the discovery of FTY720 (fingolimod), a sphingosine ana- 
logue that alters immune cell trafficking and is already being used 
in the clinic for the treatment of multiple sclerosis’. 

These are exciting times for the field and research continues 
apace. Several sphingolipid signalling protein structures have been 
solved, allowing for rational drug design. This Review will focus on 
the function of three key bioactive sphingolipids: ceramide, C1P 
and S1P, and their roles in inflammation. Although this is a nor- 
mal physiological response to harmful stimuli such as infection, 
unchecked inflammation can lead to numerous pathophysiological 
states, including oedema, asthma, inflammatory bowel disease and 
associated cancer, and autoimmune disorders such as multiple scle- 
rosis and rheumatoid arthritis. Sphingolipid metabolites play cru- 
cial parts at multiple stages of these disorders, and new mechanistic 


I n the 1880s, the neurochemist J. L. W. Thudichum presciently 


perspectives on their actions will be discussed. We will also highlight 
how knowledge gained in this relatively new field will aid in the 
development of therapeutic options for inflammatory disorders. 


Sphingolipid metabolism 

Sphingolipids are essential lipids consisting ofa sphingoid backbone that is 
N-acylated with various fatty acids to form many ceramide species, which 
can have hundreds of distinct head groups. As this Review is mainly con- 
cerned with sphingolipid signalling and inflammatory diseases, we will 
focus on mammalian sphingolipids and their metabolism. Sphingolipid 
synthesis starts in the endoplasmic reticulum (ER) with the condensation 
of serine and palmitoyl coenzyme A (CoA) by serine palmitoyltransferase 
(SPT), the rate-limiting step, forming 3-ketosphinganine (Fig. 1b). SPT 
mutations can shift substrate preference to alanine, producing neuro- 
toxic deoxysphingolipids that cause hereditary sensory and autonomic 
neuropathy type 1 (ref. 5). SPT activity is negatively regulated by ORMDL 
proteins’, and a genetic variant that increases ORMDL3 expression has 
been linked to increased risk of asthma’. Reduction of 3-ketosphinganine 
forms dihydrosphingosine that is then N-acylated by one of six ceramide 
synthases (CerS1—CerS6), each using specific acyl chains, typically with 
saturated or mono-unsaturated fatty acids with 14 to 26 carbons, forming 
dihydroceramides that are subsequently dehydrogenated to ceramides by 
dihydroceramide desaturase. Metabolism of ceramide to sphingomyelin 
and a vast array of complex glycosphingolipids mainly occurs in the Golgi 
(Fig. 2). ER to Golgi transport of ceramide is mediated by ceramide trans- 
fer protein (CERT) for sphingomyelin synthesis®, or by vesicular transport 
for glucosylceramide synthesis. Ceramides can also be phosphorylated 
in the Golgi by ceramide kinase (CERK) to form C1P, a rare species. 
Glycosphingolipid synthesis requires the transfer of glucosylceramide 
to the trans-Golgi network (TGN) by four-phosphate adaptor protein 2 
(FAPP2), which also regulates vesicular trafficking from the Golgi to the 
plasma membrane’. 

Sphingolipids have a rapid turnover and their levels are controlled by 
the balance between synthesis and degradation in multiple compart- 
ments”. Sphingolipids are degraded in lysosomes by glycosidases or acid 
sphingomyelinases that remove the head groups to form ceramides. 
Deacylation of ceramide by ceramidases is the only pathway known to 
generate sphingosine. Sphingosine can be recycled back to ceramide, and 
50% or more of sphingosine molecules follow this reutilization pathway, 
which has a significant role in sphingolipid homeostasis”. Sphingosine 
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can also be phosphorylated by one of two sphingosine kinases, SphK1 and 
SphK2, forming S1P, a pleiotropic bioactive metabolite and key intermedi- 
ate in the sphingolipid-to-glycerolipid metabolic pathway”. 

S1P can then either be dephosphorylated by phosphatases, includ- 
ing two S1P-specific, ER-localized phosphatases (SPP1 and SPP2), or 
irreversibly degraded by S1P lyase (SPL) to phosphoethanolamine and 
hexadecenal that can be incorporated into glycerolipids. The fatty alde- 
hyde dehydrogenase ALDH3A2 converts hexadecenal to hexadecenoate, 
which is utilized for the formation of palmitoyl-CoA in glycerolipid syn- 
thesis, revealing the connection between sphingolipid and glycerolipid 
homeostasis’. ALDH3A2 is non-functional in Sjégren—Larsson syn- 
drome, a disease characterized by ichthyosis and intellectual disability, 
suggesting that the accumulation of S1P degradation products contributes 
to the pathogenesis”. 


The SIP gradient 

S1P concentrations are high in blood and lymph, and low in tissues, 
presumably due to the higher activity of S1P degrading enzymes in 
tissue. The high level of blood S1P has important homeostatic func- 
tions in the maintenance of vascular integrity, and the S1P gradient 
is crucial for immune cell trafficking. S1P in blood is rapidly turned 
over”, suggesting robust mechanisms for its synthesis inside cells and 
its transport outside. Both SphK1 and SphK2 contribute to circulating 
pools of S1P. Erythrocytes” and vascular endothelial cells'* are the 
main sources of blood S1P, and lymphatic endothelial cells are the 
main sources for lymph $1P”’. S1P is exported out of vascular and 
lymphatic endothelial cells by the specific transporter Spns2, which 
is not present in platelets or erythrocytes, suggesting the existence of 
other transporters in these cells'*’”. Although ceramide, which is also 
abundant in plasma, is associated with very-low-density lipoprotein 
and low-density lipoprotein, $1P in plasma is bound to both albumin 
and apolipoprotein M (apoM), which preferentially associates with 
high-density lipoprotein. However, only high-density lipoprotein 
containing apoM-S1P is required to preserve endothelial barrier 
integrity’. ApoM produced by the liver influences plasma S1P levels 
by enhancing S1P biosynthesis for secretion by hepatocytes, dem- 
onstrating that the liver is involved in S1P dynamics and that apoM 
delivers $1P to extrahepatic tissues””’. 


Sphingolipid signalling 

Multiple stimuli affect sphingolipid biosynthesis and metabolism by 
regulating key enzymes in a spatial and temporal manner, leading 
to the restricted production of ceramide, C1P and S1P?. For exam- 
ple, pathogens, oxidative stress and cytokines activate acid or neutral 
sphingomyelinases to generate ceramide in specific compartments’. 
Sphingosine generated by ceramidase is another sphingolipid metab- 
olite involved in cell signalling” (see Review by Platt on page 68). 
However, its role in inflammatory responses is not yet clear". Some 
inflammatory mediators also activate SphK1 by extracellular signal 
regulated kinase (ERK)-mediated phosphorylation, which promotes 
SphK1’s translocation to the plasma membrane, at which its substrate, 
sphingosine, is localized and/or generated, resulting in transient eleva- 
tions in $1P levels*’’. Although these studies were mainly carried out 
with cultured cells, the development of high-sensitivity mass spec- 
trometry methods has led to demonstrations of changes in levels of 
bioactive sphingolipids being reported in specific tissues from mouse 
inflammatory disease models” and, in a few cases, in human sam- 
ples*’*’. More studies are needed to rigorously correlate bioactive 
lipids with disease stage. Understanding what bioactive sphingolip- 
ids do and how they signal is crucial for our understanding of their 
functions in inflammatory responses. 


Ceramide 

Despite the involvement of ceramide in numerous biological pro- 
cesses, only a few of its direct targets have been described. The 
best examples are ceramide-activated serine/threonine protein 
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Figure 1 | Sphingolipid metabolism and interconnected bioactive 
sphingolipid metabolites. a, Structure of ceramide, with a sphingosine 
backbone. b, De novo sphingolipid biosynthesis starts with the condensation 
of palmitoyl coenzyme A (CoA) and serine by serine palmitoyltransferase 
(SPT), an enzyme that is negatively regulated by ORM1-like protein 3 
(ORMDL3). This is followed by a series of reactions (catalysed by enzymes 
in red) leading to formation of ceramide and subsequent formation of 
sphingomyelin and glycosphingolipids. Ceramide can be metabolized to other 
bioactive sphingolipid species, phosphorylated by ceramide kinase (CERK) 
to ceramide-1-phosphate (C1P), or hydrolysed to sphingosine, which is then 
phosphorylated to sphingosine-1-phosphate (S1P) by sphingosine kinases 
(SphKs). $1P can be degraded by phosphatases to sphingosine or by the 
lyase (SPL) that cleaves it to phosphoethanolamine and hexadecenal, which 
are subsequently reincorporated into glycerolipid metabolic pathways. For 
simplicity, degradative enzymes (blue) for reutilization of sphingolipids in 
the salvage pathway are included but these reactions take place in different 
subcellular compartments (see Fig. 2). CDase, ceramidase; CerS, ceramide 
synthase; GCase, glucosylceramidase; GCS, glucosylceramide synthase; Pase, 
phosphatase; PtdEtn, phosphatidylethanolamine; SMase, sphingomyelinase; 
SMS, sphingomyelin synthase; SPPase, sphingosine phosphate phosphatase. 


Sphingomyelin 


phosphatases, including PP1, PP2A and PP2C*’, PP2A is found 
in complex with SET (also known as I2PP2A), which inhibits PP2A 
function. Ceramide binds to SET, relieving its inhibitory actions”. 
Activation of PP2A that leads to dephosphorylation of Akt, a potent 
promoter of cell survival, could partly explain the pro-apoptotic 
actions of ceramide. In additional, PP1 could be the target of cera- 
mide generated at the plasma membrane”. Stimulation of the atypi- 
cal protein kinase PKCC by ceramide, probably concomitant with its 
membrane recruitment, also leads to inhibition of Akt™. 

Another concept that has been subject to renewed interest is the for- 
mation of ceramide-enriched membrane microdomains or platforms 
as a mechanism by which ceramide transduces intracellular signalling. 
Alterations in membrane domains, owing to the elevation of ceramide 
and its tendency to self-associate, influence membrane composition 
and interactions of lipids or signalling proteins. These ceramide-rich 
platforms have been implicated in a variety of signalling cascades in 
immune cells, including activation of B cells, bacterial pathogen infec- 
tions and release of cytokines during infection; they are also especially 
important in the induction of apoptosis”. 
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Figure 2 | Subcellular compartmentalization of sphingolipid 
metabolism. De novo ceramide (Cer) synthesis takes place in the 
endoplasmic reticulum (ER). Cer is delivered by ceramide transport 
protein (CERT) or vesicular transport to the Golgi for synthesis of 
ceramide-1-phosphate (C1P) (by ceramide kinase, CERK), sphingomyelin 
(SM), and glucosylceramide (GluCer). Four-phosphate adaptor protein 

2 (FAPP2) then transports GluCer to the trans-Golgi for biosynthesis 

of complex glycosphingolipids (GSLs). SM and GSLs are delivered to 

the plasma membrane by vesicular transport and C1P by a C1P-specific 
transfer protein (CPTP). For signalling at the plasma membrane, 
sphingomyelinase (SMase), ceramidase (CDase), and sphingosine kinases 
(SphK) produce the bioactive metabolites Cer, sphingosine (Sph) and 
sphingosine-1-phosphate (S1P), respectively. S1P is then transported 
across the membrane. Membrane sphingolipids are internalized by the 
endocytic pathway and in the lysosome they are degraded by acidic 

forms of SMase, glycosidase (GCase) and CDase. The Sph formed can be 
metabolized to glycerolipids after phosphorylation by SphKs (probably 
SphK1) and cleavage by S1P lyase (SPL) or reutilized for sphingolipid 
synthesis in the salvage pathway. In the nucleus, SphK2-produced S1P 
inhibits histone deacetylases. 


CIP 

C1P is formed on the TGN by CERK. Here, C1P activates cytosolic 
phospholipase-A2a (cPLA2a)**, the enzyme that releases the eicosa- 
noid precursor arachidonate. Although early concerns regarding 
cPLA2q activation by C1P were raised”, others have shown that not 
only does C1P bind cPLA2a through an RxRH motif in its amino- 
terminal CaLB (also known as C2 lipid-binding domain) and recruit 
it to the membrane at which its phospholipid substrates reside, C1P 
also stimulates its activity and therefore has been implicated in eicos- 
anoid production****. Conversely, C1P has been shown to inhibit 
ADAM 17 (also known as tumour necrosis factor (TNF)-convert- 
ing enzyme, or TACE), the major metalloprotease responsible for 
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cleaving pro-TNF to release the active inflammatory form, probably 
by binding to one or more similar tribasic motifs”. 


SIP 

The many diverse roles of S1P in innate and adaptive immunity, includ- 
ing immunosurveillance, immune cell trafficking and differentiation, 
immune responses and endothelial barrier integrity are mediated by 
its binding to one of five G-protein-coupled receptors, named $1PR1- 
S1PRS (refs 3, 40, 41). Downstream signalling of these receptors is com- 
plexas they are differentially expressed in immune and endothelial cells 
and couple to a varied set of heterotrimeric G proteins. For example, acti- 
vation of S1PR1 generally promotes cell migration and the egress of T and 
Blymphocytes from lymphoid tissues”, whereas $1PR2 inhibits motility 
to promote retention of B cells in germinal centres’. S1P produced inside 
cells by activation of SphK1 can be secreted by SPNS2 or promiscuous 
ABC transporters”. S1P, in turn, signals through its receptors ina par- 
acrine and/or autocrine manner, termed ‘inside-out’ signalling. However, 
the crystal structure of S1PR1 suggests that $1P must slide laterally within 
the plane of the bilayer to access the binding pocket”, raising the question 
of whether S1P needs to be secreted for its autocrine actions. 

Much less is known about the intracellular targets of S1P. Within the 
past decade, new intracellular targets of S1P have been characterized. 
S1P formed by SphK1 in response to TNF or interleukin-1 (IL-1) binds 
to TNF receptor-associated factor 2 (TRAF2) and cellular inhibitor of 
apoptosis 2 (cIAP2), respectively, and enhances their lysine-63-linked 
polyubiquitylation activities“*””. In response to IL-1, which has a pivotal 
role in autoinflammatory diseases, cC[AP2 and SphK1 form a complex 
with interferon regulatory factor-1 (IRF1), leading to its polyubiqui- 
tylation and activation. Consequently, IRF1 enhances expression of the 
chemokines CXCL10 and CCLS5 that recruit mononuclear cells to sites 
of sterile inflammation”. Thus, in addition to canonical $1P receptor 
signalling, these new intracellular functions of $1P might be involved 
in the immune and inflammatory responses of these potent cytokines. 
Furthermore, S1P in the nucleus, formed by SphK2 or by inhibition of 
SPL, binds and inhibits the histone deacetylases HDAC1 and HDAC2, 
linking sphingolipid metabolism to inflammatory and metabolic gene 
expression” ~’. Further studies are needed to solidify these new concepts. 


Sphingolipids in endothelial biology and function 

The vascular endothelium forms a barrier that lines blood vessels 
and plays an important part in cardiovascular homeostasis and blood 
flow, and in controlling the passage of leukocytes into and out of the 
bloodstream. Ceramide and S1P have opposing effects on vascular 
endothelium functions: whereas increased ceramide leads to endothe- 
lial barrier dysfunction”, S1P is crucial for maintenance of vascular 
integrity” ™* (Fig. 3a). 

Pro-inflammatory cytokines, oxidative stress and injurious insults, such 
as platelet activating factor (PAF), TNF and lipopolysaccharide, generate 
excess ceramide in endothelial cells linked to increased vascular perme- 
ability and dysfunction (Fig. 3a). Generation of ceramide is both stimu- 
lus- and context-specific, and is due to the activation of acid or neutral 
sphingomyelinase, or increased de novo biosynthesis. Ceramide triggers 
several pathways that induce endothelial cell death, including activation 
of caspases, or PP1 or PP2A”°, and increasing mitochondrial perme- 
ability by forming ceramide-enriched platforms capable of translocating 
proteins. Moreover, PAF-induced formation of ceramide microdomains 
drives endothelial nitric oxide synthase (eNOS) activation and contrib- 
utes to barrier dysfunction”. Ceramides have also been linked to growth 
arrest, cytoskeleton rearrangements, oxidative stress and senescence of 
endothelial cells”. Thus, ceramides regulate important endothelial cell 
functions that are thought to be responsible for the pathogenesis associ- 
ated with vascular dysfunctions, including emphysema, sepsis and acute 
respiratory distress syndrome. 

Using animal models of acute and chronic inflammation, it has 
been convincingly demonstrated that plasma S1P limits disruption 
of vascular endothelial monolayers and reduces oedema” (Fig. 3a). 
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Figure 3 | S1P and S1PR1 control lymphocyte trafficking and vascular 
integrity. a, The sphingosine-1-phosphate (S1P) level in lymphoid tissues, 
such as the thymus, is low compared with the blood, forming an S1P gradient 
(shaded red) that attracts lymphocytes and promotes S1PR1-dependent egress 
into the blood. In the blood, S1P is bound to albumin (Alb) and apolipoprotein 
M (M) in high-density lipoprotein (HDL), and signalling by S1PR1 maintains 
endothelium barrier function by promoting cell-cell interactions (right). 
Conversely, ceramide (Cer) can promote barrier leakage. b, When T cells are 


S1P activates endothelial S1PR1, leading to enhanced Rac-dependent 
cytoskeleton rearrangements, contacts between cells and the matrix, 
adherens junction assembly and barrier integrity*”’. 

Lymphocytes circulate through lymph nodes for immune surveil- 
lance, entering at high endothelial venules (HEVs) — specialized blood 
vessels. Until recently, it was not known how HEVs allow lympho- 
cyte transmigration, which increases during immune responses, and 
maintain vascular integrity. A study demonstrated that podoplanin 
expressed on HEV fibroblastic reticular cells binds and activates plate- 
let C-type lectin-like receptor-2 (CLEC2)”*. Activation of CLEC2 on 
extravasated platelets leads to the release of S1P in the perivenular 
space of HEVs. S1P, in turn, enhances vascular endothelial (VE)- 
cadherin expression for maintenance of the integrity of HEVs during 
immune responses™. It is likely that the $1P-S1PR1 axis is also involved 
in regulating the integrity of HEVs. 

During inflammation, infiltrating immune cells and the production 
of pro-inflammatory cytokines increase endothelial permeability. $1P 
signalling attenuates the increased permeability induced by the inflam- 
matory mediators histamine and PAF”. Itis still not clear whether levels of 
S1P in plasma are reduced in systemic inflammation or sepsis. However, 
because levels of the S1P carrier protein apoM are decreased in patients 
with sepsis and systemic inflammatory response syndrome”, itis possible 
that S1P plasma levels are reduced in these conditions and thus contribute 
to vascular leakage. 

Although S1PR2 does not regulate basal permeability, it protects 
against acute vascular barrier disruption and lethality after antigen 
challenge or PAF administration”. Endothelial $1PR2 inhibits Akt 
activation, suppressing eNOS and NO generation, and protects against 
the disassembly of adherens junctions by decreasing S-nitrosylation of 
B-catenin and thereby vascular leakage*"’. By contrast, during endotox- 
aemia, endothelial $1PR2 enhances vascular permeability and expres- 
sion of pro-inflammatory and pro-coagulation molecules by activating 
Rho and p38 SAPK signalling pathways”. Together, these observations 
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ready to exit the thymus and enter the blood, $1PR1 is re-expressed so that 
T cells can respond to the chemotactic effect of high S1P levels in circulation. 
When T cells re-enter the blood, $1P downregulates $1PR1. Signals that control 
lymphocyte entry through high endothelial venules and retention in lymph 
nodes are not shown. Lymph node egress into lymphatic vessels also requires 
S1PR1. Increased lymphoid tissue $1P, by SPL inhibition, or in the presence of 
S1PRI1 modulators such as FTY720, blocks egress of lymphocytes by disruption 
of the S1P gradient or desensitization of S1PR1 on T cells, respectively. 


suggest that S1PR1 and S1PR2 have specific roles in preventing or trig- 
gering vascular leakage, probably by activation of distinct signalling 
pathways. For example, SIPR1 mediates G,-dependent Rac and Akt 
activation, whereas $1PR2 inhibits them and stimulates Rho*™’. Alter- 
natively, the S1P receptors might be temporally and spatially localized 
and upregulated during inflammation. 

An unexpected function of endothelial S1PR1 was discovered 
using S1PR1-specific agonists that suppressed the cytokine storm and 
recruitment of innate immune cells to the lung during influenza infec- 
tion”. The efficacy of S1PR1 agonists was a result of the inhibition of 
common pathways downstream of multiple innate sensing pathways 
that are important for interferon production™. A better understanding 
of how S1P receptors and their downstream signalling are regulated 
and how they affect the vascular system could provide the basis for 
developing improved therapies for vascular disorders. 


Sphingolipids and inflammation 

During inflammation, innate and adaptive immune cells enter sites of 
infection or injury and the activation of cytokine networks that follows 
helps to protect the host. Sphingolipid metabolites have key roles in the 
regulation of both trafficking and functions of immune cells’*™, and there 
are indications that sphingolipid metabolism and S1P receptors might be 
altered by inflammation. 

S1P drives the differentiation of many types of immune cells, 
inducing changes in their functional phenotypes and regulating 
production of pro-inflammatory cytokines and eicosanoids. These 
topics will not be covered here as they have recently been extensively 
discussed (see refs 3, 42, 64 and 65 for authoritative reviews). In par- 
ticular, S1P has emerged as a central regulator of lymphocyte egress. 
Although the S$1P-S1PR1 axis regulates trafficking and migration of 
most types of immune cells (such as T and B lymphocytes, haema- 
topoietic progenitors, macrophages, dendritic cells, neutrophils, 
mast cells, natural killer T cells and osteoclasts*”), the regulation 
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Figure 4 | Ceramide at the nexus of obesity and inflammation. Ceramide is 
generated in response to obesity signals (such as saturated fatty acids, satFA), 
lipopolysaccharide (LPS) or pro-inflammatory cytokines (such as TNF) by 
enhancement of de novo biosynthesis or sphingolipid recycling. Ceramide 

can then signal to a variety of pathways that promote inflammation and the 
deleterious effects of obesity. Ceramide can bind to and inhibit SET/I2PP2A 
(not shown), leading to the activation of PP2A, which dephosphorylates and 
inhibits Akt, blocking many of the actions of insulin signalling and leading to 
insulin resistance, and decreased cell survival; activate PKC(, also leading to 
the inhibition of Akt; induce autophagy, which can influence innate immune 
responses; activate the Nlrp3 inflammasome to promote the processing and 
secretion of the pro-inflammatory cytokines IL-18 and IL-18; and induce 
apoptosis. Binding of adiponectin to its receptors stimulates ceramidase 
(CDase), degrading ceramide to sphingosine, which is then phosphorylated to 
sphingosine-1-phosphate (S1P). Insulin stimulates Akt. It should be noted that 
this diagram is not meant to imply that all of these events occur in the same 
cell. CerS, ceramide synthase. 


of T- and B-lymphocyte trafficking is the best understood and will 
be used to illustrate the primary mechanisms involved. 


SIP and lymphocyte trafficking 

S1P concentrations in blood and lymph are much greater than in 
tissues, and lymphocyte egress from the thymus or secondary lym- 
phoid tissues is dependent on the cells sensing this gradient through 
activating S1PR1 (ref. 42). When ready to leave the thymus, T lym- 
phocytes upregulate S1PR1 (ref. 66) and exit in response to the S1P 
gradient (Fig. 3b). S1PR1-dependent Rac activation is required for 
trans-endothelial migration of T cells and lymph node egress”. After 
entering the blood, high S1P levels desensitize S1PR1, owing to phos- 
phorylation by guanine nucleotide-binding protein coupled receptor 
kinase-2 (GRK2)®. Coordinated action of selectins, integrins and 
especially the retention signals of the CCL21 chemokine receptor 
CCR7 are required for lymphocytes to enter the lymph nodes through 
HEVs™. T lymphocytes sample the environment randomly — corti- 
cal sinus probing is followed by $1PR1-dependent entry” and then 
flow into the efferent lymph containing high $1P"* (Fig. 4b). Dur- 
ing inflammation, lymphocyte retention is increased by the loss of 
S1PR1 expression, its internalization and degradation. After several 
rounds of division, newly generated effector cells upregulate S1PR1, 
lose the CCR7 retention signal and then exit into circulation and 
travel to locations of injury and inflammation. The immune system 
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co-opts the S1P-S1PRI1 axis to determine the establishment of resi- 
dent memory T cells, a distinct lineage that are embedded in non- 
lymphoid tissues and that mediate protective immunity. Regulation 
of the transcription factor KLF2 and its target gene S1pr1 in mice by 
cytokines determines whether CD8" T cells re-enter circulation or 
become tissue resident”. 

In addition to regulating the egress of B cells, the S1P-S1PR1 axis 
plays a part in the positioning of marginal-zone B cells in the spleen”’. 
These B cells are the first line of defence against systemic blood-borne 
antigens. Levels of S1P are high in the splenic marginal zone, causing 
desensitization of S1PR1 on resident B cells that allows them to migrate 
into the follicles in response to the follicular-dendritic-cell-produced 
chemokine CXC13 (ref. 42). B cells in the marginal zone shuttle con- 
stantly between this zone and follicles, depending on the S1P gradient. 
This allows for the rapid delivery of antigens from the blood to the fol- 
licles”. However, in germinal centres in which mature B cells proliferate, 
differentiate and switch antibody class production, $1PR2 activation 
and its downstream effectors Ga12, Gal3 and p115RhoGEF counter- 
act signalling by Akt, decreasing survival and responses to follicular 
chemoattractants, thus helping to focus the cells to the follicle centre”. 
It remains to be seen whether this gradient of S1P exists between the 
germinal centre and the follicle boundary. The remaining major chal- 
lenges are to understand how this gradient is produced and how to 
measure S1P pools in local environments. For S1P to be sensed during 
egress, its levels in tissue interstitial fluid would be expected to be very 
low to ensure that B-cell S1PR1 could respond to higher concentrations 
at the luminal surfaces of egress sites. However, although interstitial 
levels are generally assumed to be due to high S1P degrading activity 
in tissues, so far it has not been possible to directly measure interstitial 
fluid S1P levels. It is anticipated that this will be accomplished soon with 
the improved sensitivity of newer mass spectrometers. 


SIP could be pro- or anti-inflammatory 

The inflammatory role of S1P formed in immune cells by SphK1 or SphK2 
is still unclear. This conundrum began when inflammatory responses 
were examined in Sphk1 and Sphk2 knockout mice. Whereas some studies 
have reported that colonic inflammation” and synovial inflammation in 
TNF-induced arthritis” are reduced in Sphk1 knockout mice, others have 
reported that these mice have normal acute and chronic inflammatory 
responses”’. The retraction of a paper that suggested SphK1 hada role 
in regulating pro-inflammatory responses in a mouse sepsis model was 
concerning”, and there is no evidence that inhibiting SphK 1 or deleting 
its gene will reduce sepsis. Although Sphk2 knockout mice have normal 
neutrophil functions, in a model of bacterial lung infection these mice 
have accelerated disease progression”’. Similarly, neuroinflammation”® 
and lung inflammatory injury” induced by lipopolysaccharide were 
increased by deletion of Sphk1. 

It is possible that S1P formed by SphK1 or SphK2 has distinct func- 
tions and responses to stimuli in specific types of cells and tissues. This 
could also depend on the expression of S1P receptors and their specific 
downstream targets. Also, whereas deletion of Sphk1 decreases S1P levels 
in blood, deletion of Sphk2 increases it”””’, probably due to upregulation 
of Sphk1 expression and an increase of S1P in some tissues but not oth- 
ers’*. Changes in levels of $1P in the circulation and in tissues could affect 
immune-cell differentiation and functional phenotypes™. Production of 
S1P is spatially and temporally regulated and its physiological and inflam- 
matory outcomes are context-, tissue- and source-dependent. Future 
studies with conditional knockout mice could clarify the inflammatory 
roles of S1P and the kinases that produce it in specific tissues. 


Sphingolipids may mediate inflammatory signalling by TNF 

TNE akey pro-inflammatory cytokine and therapeutic target in inflam- 
matory diseases, stimulates sphingolipid metabolic enzymes includ- 
ing sphingomyelinases, ceramidase and SphK1 (refs 2, 79). Several 
sphingolipid metabolites have been suggested to be involved in TNF 
signalling and chemokine production based on studies of reduction or 


© 2014 Macmillan Publishers Limited. All rights reserved 


overexpression of these enzymes. For example, cells that are deficient in 
acid sphingomyelinase or acid ceramidase have decreased production 
of CCL5 induced by TNE, possibly due to regulation of sphingosine lev- 
els”, whereas knockdown of the gene encoding neutral sphingomyeli- 
nase regulates adhesion molecules independently of NF-«B activation”. 
The effects of deletion or knockdown of SphK1 have yielded conflicting 
results. Initially, overexpression of a dominant-negative SphK1 mutant” 
or SphK1 downregulation*"’ reduced NF-xB activation, production of 
cytokines and chemokines, and induction of COX2 and PGE2 produc- 
tion in response to TNE. More recently, S1P formed by TNF-mediated 
activation of SphK]1 was suggested to be crucial for TRAF2-mediated K63 
polyubiquitylation of RIP1, a key step in NF-«B activation*®. However, 
studies have now found that downregulation of SphK1 had no effect on 
TNF-induced NF-«B activation but enhanced CCLS expression through 
the p38 MAPK pathway, whereas downregulation of SphK2 reduced 
CCLS5 expression without affecting NE-«B*’. Moreover, TNF-mediated 
NF-«B activation and cytokine expression in murine macrophages that 
lack both SphK1 and SphK2 were not altered but these cells had increased 
sphingosine and ceramide levels and autophagy™. Clearly, resolving these 
discrepancies requires further study; however, there are several intriguing 
possibilities. The interconversion of sphingolipid metabolites is rapid, 
and fluxes of sphingolipids depend on the cell type; thus, distinct bioac- 
tive sphingolipids could be increased by TNF and their targets could be 
cell-type dependent. The outcome of inside-out signalling by $1P induced 
by TNE, as observed in certain cell types’, would depend on the expres- 
sion of S1P receptors and their downstream effectors. It is also possible 
that deletion of sphingosine kinases induces compensatory mechanisms. 
Moreover, TNF-induced inflammatory responses are not only regulated 
by NF-«B but are also controlled by crosstalk between NF-«B with other 
context-dependent pathways. Finally, TNF signalling leading to NF-«B 
and cytokine and chemokine production is now appreciated to be more 
complex than was originally suggested”. The recent discovery of RIP1- 
dependent and -independent activation of the early and late phases of IkB 
kinase (IKK), which are differentially regulated by low and high doses of 
TNF that lead to the expression of distinct sets of NF-«B target genes™, 
suggests a potential explanation for the divergent findings on the role of 
SphK1 and S1P. Exploring these possibilities might reconcile contradic- 
tory observations and shed new light on the inflammatory roles of S1P. 


Ceramides and inflammation 

During host-pathogen defence, pattern recognition receptors, includ- 
ing Toll-like receptors (TLRs) and Nod-like receptors (NLRs), expressed 
by sentinel cells, macrophages and dendritic cells, bind microbial prod- 
ucts to initiate innate immune responses and prime adaptive immunity. 
Integration of data from sphingolipidomics and transcriptomics of mac- 
rophages treated with lipopolysaccharide, the ligand for TLR4, revealed 
that several enzymes in sphingolipid metabolism and their products, 
particularly ceramide, were coordinately elevated****. Several intrigu- 
ing ideas have been proposed to explain how ceramide links TLR4 and 
other members of this family to inflammation (Fig. 4). In macrophages, 
increased de novo ceramide biosynthesis is required for autophagosome 
formation”, which is thought to have key roles in innate immunity”. A 
question still to be answered is whether changes in autophagy owing to 
ceramide contributes to immunity and/or inflammation. Disruption of 
membrane microdomains in Cers2 knockout mice, which are unable 
to make very-long-chain ceramides, prevented TNF-R1 internalization 
and downstream signalling™. 

Another area that has received much attention is the connection 
between low-grade chronic inflammation and the pathogenesis of obe- 
sity and metabolic dysfunction”. A recently uncovered mechanistic link 
between increased inflammation and obesity is the activation of TLR4 
by saturated fatty acids, leading to transcriptional activation of ceramide 
biosynthetic genes, including those that encode SPT and specific CerS 
isoforms, in an IKK-B-dependent manner”. Interestingly, although 
not required for TLR4-dependent inflammatory cytokine induction, 
elevated ceramide production was required for TLR4-dependent insulin 
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resistance, owing to inhibition of Akt’’, the activation of which is neces- 
sary for stimulated glucose transport” (Fig. 4). Likewise, CerS2 knockout 
mice are glucose intolerant, owing to the inability of the insulin recep- 
tor to translocate to the disrupted membrane platforms, preventing Akt 
activation”. Together, these studies implicate ceramide as a mediator 
linking lipid-induced inflammatory pathways and insulin resistance. 

The NLR family member NLRP3 inflammasome is activated by 
diverse non-microbial danger-associated molecular patterns (DAMPs) 
derived from damaged cells, and induces inflammation by increasing 
IL-16 and IL-18 secretion. In obese mice, lipotoxicity-associated cera- 
mide was shown to increase activated caspase-1 in a Nlrp3 inflamma- 
some-dependent manner in macrophages and adipose tissue”. This 
is the first suggestion that the Nlrp3 inflammasome senses increases 
in intracellular ceramide (Fig. 4)””**. Moreover, age-related increased 
thymic ceramides also leads to Nlrp3 inflammasome-dependent cas- 
pase-1 activation”, suggesting that this form of ceramide elevation is 
responsible for the collapse of the thymic stromal cell microenviron- 
ment and subsequent decreased production of naive T cells and lower 
immune surveillance in the elderly”. 


Ceramides and adipokines 
An intriguing potential link has surfaced between sphingolipid metabo- 
lism and the pleiotropic actions of the adipocyte hormone adiponectin. 
By binding to its receptors AdipoR1 or AdipoR2 adiponectin decreases 
inflammation and increases insulin sensitivity”. AdipoR1- and Adi- 
poR2-stimulated ceramidase degraded ceramide to sphingosine, which 
was converted into the pro-survival S1P; evidence indicates that cer- 
amidase activation initiates many adiponectin actions (Fig. 3)”. It is 
unclear whether AdipoR1 and AdipoR2 have intrinsic ceramidase activ- 
ity, although a recent report suggests that AdipoRI, but not AdipoR2, 
induces caveolin-dependent recruitment of neutral ceramidase”’. 
Subsequent studies demonstrated that the adiponectin-ceramide 
axis is also involved in fibroblast growth factor-21 (FGF21)-mediated 
regulation of carbohydrate and lipid metabolism, and energy utiliza- 
tion. The beneficial effects of FGF21 were abolished in adiponectin- 
null mice, which were resistant to FGF21-induced decreases in ceramide 
and increases in energy expenditure”. Adiponectin also mediated the 
FGF21-induced decrease in production of liver pro-inflammatory 
cytokines in obese, diabetic mice”. Experiments are needed to determine 
which adiponectin-mediated effects depend mainly on decreasing cera- 
mide signalling and which depend on increasing S1P signalling. Interest - 
ingly, thiazolidinediones — agonists for the nuclear receptor PPAR-y, 
widely used for management of type 2 diabetes — also induce FGF21 
and adiponectin, and decrease ceramide in plasma and the liver”’. These 
FGF21 and thiazolidinedione effects bring ceramides into the spotlight 
for research on new therapeutics for obesity-induced inflammation and 
metabolic disorders. 


New insights on the role of C1P in inflammation 

CIP isa relatively new addition to the group of bioactive sphingolipid 
metabolites involved in inflammation. C1P activates cPLA2a*, which 
releases arachidonic acid in the rate-determining step in eicosanoid 
production. As eicosanoids drive the pathogenesis of many inflam- 
matory disorders, understanding the role of C1P in this process is 
crucial. An intriguing example of the latter was the characterization 
of a unique C1P-specific transfer protein called CPTP”™*. Depletion of 
CPTP, which is present in the cytosol and associated with the TGN 
and plasma membrane, increases C1P in the Golgi and nucleus, but 
decreases it in the plasma membrane. This suggested that CPTP con- 
trols levels of C1P produced by CERK in the Golgi by transferring it 
to the plasma membrane, thereby suppressing cPLA2a activity and 
reducing arachidonic acid release and eicosanoid production**’. The 
C1P that CPTP transfers to the plasma membrane is probably rap- 
idly degraded by lipid phosphatases to ceramide. Another intriguing 
observation from this study was that changes in C1P levels directly 
correlated with changes in sphingosine and S1P, raising the possibility 
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that C1P might be a precursor of S1P that is mediated by, yet to be 
characterized, deacylase activity**. Global CERK inactivation has 
only modest effects on inflammatory responses, in contrast to cPLA2 
knockout, and further studies are necessary to address this discrep- 
ancy. Nevertheless, acute downregulation of CERK using short inter- 
fering RNA did reduce eicosanoid production in a mouse model of 
airway hyper-responsiveness”. 

The concept of immunomodulation by C1P has become increas- 
ingly intricate as other studies have shown that C1P also negatively 
regulates inflammatory cytokine production. C1P produced by CERK 
regulates TNF processing and secretion by direct inhibition of TACE”. 
In addition, exogenous C1P was found to suppress lipopolysaccharide- 
mediated production of TNE, IL-6, IL-8 and IL-1 in human peripheral 
blood mononuclear cells”. Although exogenous C1P can stimulate 
macrophage migration by a pertussis-toxin-sensitive GPCR™, a cell 
surface C1P receptor has never been identified. It is also unclear what 
biological effects, if any, are regulated by circulating C1P. 


Therapeutic interventions in inflammatory disorders 
Much of the clinical interest in sphingolipid metabolites and their sig- 
nalling was spurred by the discovery of the mechanism of action of the 
immunosuppressant drug FT Y720 (ref. 102). In vivo, SphK2 is mainly 
responsible for the phosphorylation of FTY720 to FTY720-P, an S1P 
mimetic and agonist for all S1P receptors except S1PR2. Binding of 
FTY720-P to S1PR1 leads to its downregulation, thereby effectively 
acting as a functional S1PR1 antagonist. Down-modulation of S1PR1 
induces the retention of lymphocytes in lymphoid organs, termed lym- 
phopenia, because the receptor is required for their exit into the blood 
up the S1P gradient; this prevents adaptive immune surveillance of tis- 
sues, and thus can ameliorate autoimmune diseases. 


Autoimmune diseases 

Thousands of patients have been treated with FT Y720 for the 
relapsing-remitting form of multiple sclerosis. In experimental 
autoimmune encephalomyelitis (EAE), an animal model of multi- 
ple sclerosis, FTY720 prevented pathogenic T cells committed to T 
helper (Ty) 1 and T17 lineages from crossing the blood-brain bar- 
rier into the central nervous system*. Down-modulation of $1PR1 in 
astrocytes by FTY720 reduced astrogliosis”, a hallmark of multiple 
sclerosis, and promoted remyelination and recovery of nerve con- 
duction. These are likely explanations for the significant decrease in 
inflammatory markers and brain atrophy after FTY720 treatment of 
patients with multiple sclerosis’. 

FTY720 and other next-generation S1PR1-modulating drugs are 
now in various stages of clinical trials for the treatment of a variety of 
immune diseases. One of the side effects of these modulating drugs 
is the transient reduction of heart rate by the activation of GIRK 
potassium channels downstream of cardiomyocyte S1PR1 (ref. 104), 
but with careful monitoring of patients after the first dose this should 
not prevent their use. FTY720 also induces pulmonary vascular leak- 
age in mice by inducing degradation of S1PR1 (ref. 105). Receptor 
pharmacology predicts that competitive S1PR1 antagonists should 
not have this adverse effect; however, they could reduce barrier func- 
tion, owing to S1PR1 blockade. So far, S1PR1 antagonists that are 
effective in various disease models, including EAE, have not been 
tested in the clinic. Even with FT Y720’s efficacy, the fact that it is 
well tolerated in patients and the convenience of oral administration, 
improvements could still be made. A deeper knowledge of S1P and 
its receptors, and an understanding of the beneficial and adverse side 
effects of targeting S1P receptors could lead to the development of 
even better treatments. 


Rheumatoid arthritis 

Rheumatoid arthritis is an autoimmune disease characterized by 
inflammation in movable joints. High levels of S1P have been found 
in the synovial fluid of patients with rheumatoid arthritis*’; however, 
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conflicting data were reported in different mouse rheumatoid arthritis 
models regarding the involvement of SphK1. Although Sphk1 knock- 
out mice showed normal responses in models of collagen-induced 
arthritis’’, synovial inflammation in TNF-induced arthritis was 
reduced”. Hence, this area of research needs to be re-examined. 

Lymphopenia-inducing drugs such as FT Y720 have potential 
as rheumatoid arthritis therapeutics, FT Y720 was protective in a 
mouse model of rheumatoid arthritis'”’; however, there are no cur- 
rent clinical trials. But, phase II clinical trials are ongoing to target 
S1P signalling by a completely different mode based on the finding 
that inhibition of SPL with the food dye THI (2-acetyl-4-tetrahy- 
droxybutylimidazole) induces lymphopenia by elevating the levels 
of S1P in lymphoid tissues, disrupting the S1P gradient lymphocytes 
use for egress’””. Subsequent studies of genetically deficient SPL mice 
revealed that they also had a disrupted S1P gradient and lymphope- 
nia. On the basis of THI structure, a second compound, LX2931, was 
developed that was effective in a mouse model of rheumatoid arthritis 
and is now in phase II trials”. It remains to be seen whether direct 
inhibitors of SPL are clinically useful. 


Inflammatory bowel disease 
Inflammatory bowel disease (IBD), a result of chronic inflamma- 
tion of the gastrointestinal tract, is due to abnormal host immune 
responses to the intestinal microbiome’”. Millions of people world- 
wide have ulcerative colitis and Crohn’s disease, the major types of 
IBD. Studies using common acute and chronic epithelial injury coli- 
tis models have underscored a role for the SphK1-S1P-S1PR1 axis in 
IBD. Knockout of the gene encoding SphK1 or specific SphK1 inhibi- 
tors greatly reduce colitis severity, cytokine production and systemic 
inflammation”. In agreement, S1P is elevated during colitis and 
SphK1 expression is increased in patients with ulcerative colitis”. 
Several studies have also shown that FT Y720 and other pharmaco- 
logical agents that target S1PR1 are very effective in rodent models 
of colitis’*’'°'"’, and several second-generation $1PR1 modulators 
are currently in clinical trials for patients with ulcerative colitis. 
Persistent inflammation of the colon increases the risk of develop- 
ing colorectal cancer’. There is increasing evidence from mouse 
models that SphK1 and S1P may bea link between chronic intesti- 
nal inflammation and the development of colitis-associated cancer 
(CAC), and that both are important for colon cancer development 
and progression’”*"*"*, An intracellular role for SphK1 in intesti- 
nal adenoma cell proliferation and polyp size was shown by dele- 
tion of SphK1 in Apc™”* mice, a model of intestinal neoplasia’. 
Upregulation of SphK1 during colitis and CAC in mice leads to 
NF-«B-dependent production of IL-6, robust activation of the tran- 
scription factor Stat3, and subsequent upregulation of its target gene, 
Slpr1. FTY720 impeded the SphK1-S1P-S1PR1 axis, suppressing 
the NF-«xB-IL-6-Stat3 vicious feed-forward amplification loop and 
CAC”. Thus, it seems that FTY720 and second-generation S1PR1 
modulators should be considered for the treatment of inflammation- 
driven cancers. 


Asthma 

Sphingolipid metabolites contribute to the pathogenesis of asthma, 
an increasingly prevalent chronic airway inflammation characterized 
by intermittent airflow obstruction and increased mucus production. 
S1P levels are elevated in the lungs of patients with allergic asthma. In 
mice, S1P is involved at multiple stages of the asthmatic response: it 
promotes the contraction of airway smooth muscle cells, the induc- 
tion of airway hyper-reactivity by S1PR1 and/or S1PR3 (refs 115, 
116), and regulates activation and functions of mast cells, eosino- 
phils and dendritic cells*”"”’. Inhalation of an SphK1 inhibitor, or 
FT Y720, mitigated asthma in rodent models””"'”. De novo ceramide 
synthesis has also been implicated in human asthma based on the 
association of a 17q21 single nucleotide polymorphism (SNP) with 
increased expression of ORMDL3 (ref. 7), a negative regulator of 
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SPT, the first enzyme in sphingolipid synthesis”'”*. Patients with this 
SNP are at increased risk of developing childhood asthma as well as 
virus-induced respiratory wheezing”. However, how ORMDL3 is 
involved in asthma remains a matter of debate. It has been shown 
that allergens upregulate ORMDL3 expression, and overexpression 
of ORMDL3 increased expression of asthma-associated chemokines 
and metalloproteases and the unfolded protein response (UPR), sug- 
gesting that ORMDL3 and asthma may be linked through the UPR 
pathway’. ORMDL3 also promotes eosinophil trafficking, recruit- 
ment and activation. By contrast, although ceramide is elevated in 
murine models of asthma’”, decreased de novo sphingolipid biosyn- 
thesis enhances bronchial reactivity by affecting intracellular mag- 
nesium homeostasis in the absence of inflammation’. Although 
these studies functionally link de novo sphingolipid biosynthesis to 
asthma, many questions remain unanswered, presenting challenging 
opportunities for research. 


Future perspectives 

The study of bioactive sphingolipids in inflammation is only now 
coming to the foreground. As always in science, answering one ques- 
tion leads to many more, and this is especially true of these enigmatic 
sphingolipids. Clearly, more work is required to understand how 
inflammation regulates bioactive sphingolipid levels, the activities 
of the enzymes that control their levels and the proteins they sig- 
nal through, and how this shapes the immune response. Although 
around a dozen extracellular and intracellular targets of signal- 
ling sphingolipids have been identified, it is unlikely that we have 
found them all, based on their pleiotropic effects. This is especially 
important for understanding the functions of ceramide in inflam- 
mation as only a few direct targets have been confirmed and how 
it regulates all inflammasomes remains a mystery. The complex- 
ity of sphingolipid metabolism and the ‘sphingolipidome; and the 
fast interconversion of bioactive sphingolipids make this task even 
more daunting. This endeavour goes hand-in-hand with current 
systems-biology research, and it will be greatly aided by advanced 
sphingolipidomics — which can quantitatively measure changes in 
sphingolipids bound to their protein targets — combined with the 
development of conditional and tissue-specific sphingolipid-meta- 
bolic-enzyme knockout mice. Moreover, current sphingolipidomic 
technology for simultaneous quantification during inflammation of 
multiple sphingolipid species, for example, ceramides with different 
fatty acids, will be particularly helpful as these signalling molecules 
could have distinct and, sometimes, even opposing functions. Such a 
diverse array of bioactive sphingolipids could provide immune cells 
with a large toolkit to fine-tune specific inflammatory responses. 
Finally, improvement of matrix-assisted laser desorption ionization 
(MALDI) technology may, in the future, allow imaging of the S1P 
gradient in tissues such as the lymph nodes, the thymus and the 
spleen; this is crucial for our understanding of the role of the $1P- 
S1PR1 axis in lymphocyte trafficking. 

Given the success of FT Y720 for treatment of multiple sclerosis, 
it is hoped that next-generation S1PR1 modulators will find even 
wider therapeutic uses in other inflammatory disorders — as has 
been suggested by pre-clinical data. Progress in targeting other S1P 
receptors will increase our basic knowledge of their functions and 
how S1P can induce pro- and anti-inflammatory responses in dif- 
ferent cell types, and will enable the rational design of more selective 
and potent inhibitors of immune responses. There are ongoing clini- 
cal trials with other interventions that target bioactive sphingolip- 
ids, such as Sphingomab, a monoclonal antibody that neutralizes 
S1P; ABC294640, which inhibits SphK2; and amitriptyline, which 
inhibits acid sphingomyelinase, and perhaps they will turn out to 
be useful for inflammatory disorders. Although such advances for 
other sphingolipid metabolites are still to be made, now that X-ray 
crystal structures of sphingolipid metabolic enzymes are available 
and carriers and targets of bioactive metabolites are gradually being 
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elucidated, it is expected that drugs will be developed to interfere with 
specific sphingolipid-mediated functions, which are important for 
the pathogenesis of human inflammatory disease. m 
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Sphingolipid lysosomal 


storage disorders 


Frances M. Platt? 


Lysosomal storage diseases are inborn errors of metabolism, the hallmark of which is the accumulation, or storage, of 
macromolecules in the late endocytic system. They are monogenic disorders that occur at a collective frequency of 1 in 
5,000 live births and are caused by inherited defects in genes that mainly encode lysosomal proteins, most commonly 
lysosomal enzymes. A subgroup of these diseases involves the lysosomal storage of glycosphingolipids. Through our 
understanding of the genetics, biochemistry and, more recently, cellular aspects of sphingolipid storage disorders, we 
have gained insights into fundamental aspects of cell biology that would otherwise have remained opaque. In addition, 
study of these disorders has led to significant progress in the development of therapies, several of which are now in routine 
clinical use. Emerging mechanistic links with more common diseases suggest we need to rethink our current concept of 


disease boundaries. 


shared fate. They undergo catabolism in late endosomes/lysosomes 

through the action of acid hydrolases. In this Review, I focus on 
sphingolipids, and in particular on glycosphingolipids (GSLs), and 
the diseases that arise through defects in their catabolism — the gly- 
cosphingolipidoses'. The glycospingolipidoses are a prototypical family 
of lysosomal storage diseases (LSDs) that share some features with other 
lysosomal disorders”. Indeed, many of the therapeutic approaches for 
treating LSDs were originally developed for treating sphingolipid lyso- 
somal diseases. 

GSLs are a complex group of sphingolipids, in which the ceramide 
backbone is modified by the addition of a carbohydrate head group. The 
simplest GSLs have a single monohexoside (glucose or galactose) moi- 
ety linked to ceramide’. Over evolutionary time the carbohydrate head 
group has increased in complexity — more than 300 structures have been 
described, with those derived from the core structure glucosylceramide 
(GlcCer) being the most common. Galactosylceramide (GalCer) and its 
sulphated derivative have a much more restricted distribution and are 
generally confined to myelin and the kidney’. When the oligosaccha- 
ride head group of GlcCer-derived GSLs contains a sialic acid, the GSL 
is charged, and these GSLs are termed gangliosides and are the major 
glycoconjugates found in the nervous system’ (Fig. 1). 

The complexity of sphingolipids is not just the result of the gly- 
can head group, but also the ceramide chain-length heterogene- 
ity that arises owing to the differential chain-length specificity of 
ceramide synthases’. 


| ipids, in common with other cellular macromolecules, have a 


Inherited diseases of GSL biosynthesis and catabolism 

Inborn errors of GSL metabolism occur in the human population 
at low frequency. They result in severe monogenic diseases, most 
typically inherited as autosomal recessive traits. For many years 
it was thought that no GSL biosynthetic diseases existed owing to 
presumed in utero lethality. Loss of GSL biosynthesis by targeted 
disruption of GlcCer synthase (that catalyses the first steps in GSL 
biosynthesis) results in lethality with a failure of embryos to develop 
beyond gastrulation in mice®. However, engineered mice that lack 
key enzymes involved in downstream steps of GSL biosynthesis, 
namely ganglioside biosynthesis, are viable’. This suggests that loss 


of the entire GlcCer biosynthetic pathway is incompatible with nor- 
mal development, but loss of certain complex ganglioside structures 
is tolerated, albeit with accompanying neurological dysfunction in 
some cases’. 


Biosynthetic diseases 

Two disorders of ganglioside biosynthesis have been genetically 
proven in the human population within the past decade, and result 
from GM3-synthase* and GM2-synthase deficiency””® (Table 1 and 
Fig. 1). GM3-synthase deficiency, which was discovered in the Old 
Order Amish community in Ohio, presents as a severe epilepsy syn- 
drome. By contrast, GM2-synthase deficiency results in spastic para- 
plegia (progressive lower limb weakness and spasticity). When GSL 
levels were measured in patient plasma, the predicted deficiencies in 
GM2 or GM3, and their downstream derivatives, were confirmed*”. 
However, levels of the precursors of GM2 or GM3 were elevated owing 
to the block in ganglioside biosynthesis*” (Fig. 1). Furthermore, GM2- 
synthase deficiency resulted in the biosynthesis of minute amounts 
ofa novel GSL structure (sialylated Gb3) that has not been previously 
reported in biological systems’. It is currently unclear whether these 
diseases are caused by the loss of the specific ganglioside species, by 
elevation in precursor GSLs or by an imbalance in the charged/neutral 
GSL ratios'’. These possibilities are not mutually exclusive. However, 
knowledge of the precise biochemical or pathophysiological basis 
for these diseases will be crucial for the development of therapeutic 
strategies for these severe, currently untreatable metabolic diseases. 
It is highly probable that these two disorders are not the only diseases 
that result from defects in complex ganglioside biosynthesis, and it 
is likely that others will be identified within the human population 
and attributed to defects in other enzymes involved in ganglioside 
biosynthesis (Fig. 1). 


Catabolic diseases 

In contrast to the paucity of GSL biosynthetic diseases, there is a rela- 
tively large group of diseases that result from defects in GSL catabo- 
lism’? (Table 1). These diseases constitute an important subgroup 
within a larger family of more than 70 LSDs. There has been debate in 
the field as to whether the term ‘storage’ is really appropriate for these 
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Figure 1 | Glycosphingolipid biosynthetic pathway. The gangliosides 
lost in GM2 synthase deficiency are indicated by the orange box and the 
gangliosides lost in GM3 deficiency are indicated by the blue box. The 
overlap of the orange and blue boxes indicates common gangliosides lost 
in both diseases. The four major gangliosides of the mammalian brain 


diseases. It perhaps over emphasizes one aspect of these complex, 
multifaceted disorders, and could be misleading when considering 
pathogenesis. However, at present, no consensus on any nomencla- 
ture change has been debated or agreed. In this Review, I therefore 
refer to LSDs when discussing monogenic catabolic diseases and use 
the term ‘lysosomal diseases’ to encompass any disease with lysoso- 
mal dysfunction, irrespective of the cause. 

Many LSDs have a progressive neurodegenerative clinical course 
and often present in infancy or childhood”’; however, symptom onset 
can occur at any stage of life and late-onset variants have also been 
described’*"'®. A major factor that has promoted our understanding 


are shown in red. The key enzymes ceramide glucosyl transferase (CGT), 
N-acetylgalactosamine transferase (GaINAcT), galactosyl transferase 
(GalT), N-acetylglucosamine transferase (GlcCNAcT), GM3 synthase 
(GM3S), iGb3 synthase (iGb3S) and lactosyl ceramide synthase (LCS) are 
shown in grey. 


of pathogenesis, and is a platform for testing therapies, is the avail- 
ability of a large number of authentic animal models for these dis- 
orders'”'*, These models span mutant and engineered mice through 
to spontaneous mutants described in species such as domestic cats, 
dogs, sheep, cattle’’, and even a flamingo with Tay-Sachs disease”. 
The availability of large and small animal models has catalysed con- 
siderable progress in our understanding of these disorders, as well as 
in the development of therapies for specific conditions” ”’. This is in 
contrast to more common sporadic neurodegenerative diseases, such 
as Alzheimer’s disease. In Alzheimer’s disease the lack of authentic 
animal models is a significant factor that contributes to the dearth 


Table 1 | Inborn errors of GLS metabolism that involve GlcCer-derived GSLs 


Metabolic defect Disease Biochemical defect Gene involved Protein defect Major symptoms CNS pathology References 
Biosynthetic GM3 synthase Loss of GM3 and ST3GAL5 GM3S Infantile onset epilepsy with Yes 8 
deficiency downstream gangliosides developmental arrest and 
blindness 
Biosynthetic GM2 synthase Loss of GM2 and BA4GALNT1 GM2S Spastic paraplegia Yes 9,10 
deficiency downstream gangliosides 
Catabolic (LSD) Gaucher types GlcCer storage GBA Glucocerebrosidase Hepatosplenomegaly, In types 2 and 3 43, 108, 
1,2and3 haematological defects, 09 
inflammation, bone disease and 
CNS disease in types 2 and 3 
Catabolic (LSD) Fabry Gb3 storage GLA a-galactosidase Renal, cardiovascular and Yes (Some 0,111 
peripheral pain mild cases of 
cerebrovasculopathy) 
Catabolic (LSD) Tay-Sachs GM2 ganglioside storage HEXA B-hexosaminidase Progressive neurodegeneration Yes 2, 
a subunit 3,114 
Catabolic (LSD) Sandhoff GM2 ganglioside storage HEXB B-hexosaminidase Progressive neurodegeneration Yes 5,116 
6 subunit 
Catabolic (LSD) GM1 GM1 ganglioside storage GLB1 B-galactosidase Progressive neurodegeneration Yes EP 
gangliosidosis 8,119 
Defect in Niemann-Pick Storage of all GSLs, NPC1 NPC1 Progressive neurodegeneration Yes 20,121 
trafficking or type C cholesterol, sphingomyelin NPC2 NPC2 
fusion and sphingosine 


CNS, central nervous system; GlcCer, glucosylceramide; GM2S, GM2 synthase; GM3S, GM3 synthase; GSL, glycosphingolipid; LSD, lysosomal storage disease. 


5 JUNE 2014 | VOL 510 | NATURE | 69 


© 2014 Macmillan Publishers Limited. All rights reserved 


REVIEW 


of treatment options available for this common disease of ageing™. 
In the remainder of this article I focus on recent advances in our 
understanding and treatment of GSL LSDs. 


GSLLSDs 


Lysosomal diseases that involve sphingolipid storage occur through two 
distinct mechanisms”. Primary sphingolipidoses are caused by inherited 
defects in genes encoding the lysosomal enzymes (or their non-enzymatic 
cofactors) required for sphingolipid catabolism in late endosomes/lys- 
osomes (Table 1 and Fig. 2). However, there are also LSDs involving sphin- 
golipid storage in which the primary defect is not in a catabolic enzyme 
but in, for example, a lysosomal membrane protein”. LSDs have brought 
to light a number of proteins with unknown function that reside in the 
limiting membrane of the lysosome”*. Frequently these diseases are asso- 
ciated with defects in trafficking and fusion in the endocytic system, lead- 
ing to secondary storage of sphingolipids”. An extreme example of this is 
Niemann-Pick type C (NPC), a disease thought to be mainly a disorder 
of cholesterol transport from the lysosome”. However, sphingolipids are 
also implicated in the aetiology of this disease because impaired sphin- 
gosine efflux from the lysosome results in incomplete filling of the acidic 
store with calcium; this, in turn, results in reduced calcium release, which 
blocks late-endosome-lysosome fusion”. This causes secondary storage 
of GSLs in late endosomes/lysosomes as well as other lipids, including 
low-density lipoprotein (LDL)-derived cholesterol”. Furthermore, the 
only approved therapy (miglustat) for NPC disease is an inhibitor of GSL 
biosynthesis”, suggesting that sphingolipids have a role in this disease. 


Current therapies for GSL LSDs 

The only potential cure for any monogenic disease is gene therapy”. 
LSDs are seen as attractive candidates for gene therapy because lysoso- 
mal enzymes are secreted and recaptured by neighbouring cells so that 
not every cell needs transducing. These diseases are therefore viewed 
as tractable for the application of this therapeutic approach. So far, this 
form of therapy has not yet reached routine clinical practice. However, 
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Figure 2 | Glycosphingolipid catabolism and associated lysosomal 
storage diseases. The enzymes that catalyse the catabolism of one 
metabolite to another are shown in red, and the diseases that result from 
defects of these enzymes are shown in blue. ASA, arylsulphatase A; GBA, 
B-glucocerebrosidase; GLA, a-galactosidase; GLB, B-galactosidase; HEXA, 
B-hexosaminidase A; HEXB, B-hexosaminidase B; NA, neuraminidase; 
SMase, acid sphingomyelinase. 
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proof-of-concept studies have shown efficacy in authentic small- and 
large-animal disease models*”**, leading to clinical trials (Table 2). If 
these trials are successful, the challenge will be to intervene early, before 
irreversible pathology has occurred’””*. Another key factor is that the 
gene delivery system needs to correct both peripheral tissues and the 
central nervous system (CNS) because progressive brain pathology 
occurs in most of these diseases, often accompanied by visceral disease. 
Intravenous delivery of viral vectors could ultimately prove to be the 
optimal way of broadly distributing corrective gene therapy to all areas 
of the brain, as well as transducing peripheral organs. Elegant proof-of- 
concept studies in several disease models in mice have demonstrated the 
feasibility of intravenous gene delivery for treating the CNS, peripheral 
nervous system (PNS) and non-neuronal tissues”. However, ongoing 
or planned clinical trials all use, or will use, more-invasive approaches 
with direct delivery of vectors to the brain. Should efficacy result, less- 
invasive delivery methods will no doubt be explored in the future”. 

The next step in the pathogenic cascade, downstream of the genetic 
defect, is the defective protein. LSDs caused by enzyme deficiencies can 
be amenable to enzyme replacement therapy (ERT) using recombinant 
forms of wild-type enzymes delivered through frequent intravenous 
infusions”. Gaucher disease is a prototypical sphingolipid storage dis- 
ease caused by inherited defects in the gene encoding glucocerebrosi- 
dase, leading to storage of its substrate GlcCer®’ (Fig. 2). There are three 
forms of Gaucher disease: type 1 involves peripheral tissues and organs, 
type 2 is acute neuronopathic and type 3 can be both neuronopathic and 
involve peripheral tissues. Type 1 Gaucher disease is characterized by 
hepatosplenomegaly, haematological defects and bone disease“*. This 
was the first disease to benefit from ERT, and the therapy was the first 
to be commercialized, at that time using a non-recombinant purified 
enzyme’. Despite the remarkable efficacy of ERT in treating patients 
with type 1 Gaucher disease, lysosomal enzymes do not cross the blood- 
brain barrier, and so cannot be used to treat CNS manifestations in 
patients with type 2 and type 3 disease. The success of ERT in type 1 
Gaucher patients catalysed the development of ERT for several other 
LSDs, including another sphingolipid storage disease, Fabry disease”. 
Commercially, ERT showed that the development of therapies for rare 
diseases could be highly profitable, which has catalysed the development 
of a number of competing ERT products from multiple companies*’. 
The high costs of ERT and other treatments for rare diseases limits 
their global access and raises the spectre of health-economics arguments 
about cost effectiveness”. Owing to low patient numbers, therapies that 
could be used in multiple LSDs would be attractive. One experimental 
therapy that has shown broad benefit in in vitro models of LSDs is heat 
shock protein 70 (HSP70), which stabilizes lysosomal membranes and 
promotes clearance of storage by an incompletely understood mecha- 
nism*“’. This biologic therapy is currently in preclinical development. 

The only other approved therapy for LSDs that involve GSL stor- 
age is an oral, small-molecule approach termed substrate reduction 
therapy (SRT)”, of which miglustat is an example. GSL biosynthesis is 
inhibited by this drug, reducing the number of GSLs requiring lysoso- 
mal catabolism, thereby reducing storage”’. Miglustat is approved for 
the treatment of type 1 Gaucher disease”, as well as for NPC disease. 
For patients with Gaucher disease, miglustat offers an oral alternative 
for those who are unable or unwilling to receive intravenous ERT”. 
Miglustat is currently the only specific, approved disease-modifying 
drug for NPC disease™. It crosses the blood-brain barrier and slows 
progression of CNS disease”’. Early intervention has the most clinical 
benefit and prolongs life by reducing aspiration pneumonia, owing 
to improved swallowing”. NPC disease is not caused by an enzyme 
deficiency, but most commonly by mutations in a multimembrane 
spanning transporter called NPC1, which is located in the limiting 
membrane of the lysosome”. New strategies to treat NPC will require 
the downstream steps in the pathogenic cascade (Fig. 3) to be identi- 
fied and targeted; this disease is not currently a promising candidate 
for gene therapy because cells are not amenable to cross-correction 
by transduced cells™. 
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Table 2 | Approved therapies and those in clinical trials for lysosomal storage diseases 


Therapeutic approach Disease Status Approved product or ClinicalTrials.gov identifier 
ERT Gaucher disease* Approved Cerezyme 
VPRIV 
Elelyso 
Fabry* Approved Fabrazyme 
Replagal 
MPS | Approved Aldurazyme 
MPS Il Approved Elaprase 
MPS VI Approved aglazyme 
Pompe Approved yozyme 
PB Clinical trials CT02004704 (acid sphingomyelinase) 
SRT Type 1 Gaucher disease* Approved iglustat 
PC* Approved (not by FDA) iglustat 
SRT Type 1 Gaucher disease* Ongoing clinical trials CT00358150 (eliglustat) 
Enzyme enhancement (chaperone therapy) Fabry* Ongoing clinical trials CT01458119 (migalastat) 
Pompe Ongoing clinical trials CT01380743 (duvoglustat) 
Gene therapy Tay-Sachs* Trials pending or recruiting CT01869270 
BMT Multiple In clinical practice /A 


*Indicates sphingolipid storage diseases. BMT, bone marrow transplantation; ERT, enzyme replacement therapy; MPS, mucopolysaccharidosis; NPB, Niemann-Pick type B; SRT, substrate reduction therapy. 


The benefits and challenges of combination therapy 

The concept and application of combination therapy is well established 
in medicine and involves drugs that target two or more unique steps ina 
disease process to maximize the clinical benefit for the patient (additive 
or synergistic benefit). A complex disease such as rheumatoid arthritis is 
a good example of one in which multiple, independent steps in inflam- 
matory cascades are targeted with small molecules and biologics”. In 
LSDs, in which there is a monogenic cause, targeting the gene or defective 
protein has logically driven most of the therapeutic efforts for the past 
two decades*”*. However, gene therapy is not yet in routine clinical use 
for these diseases and, because ERT cannot target the brain, this form of 
therapy is limited to the management of peripheral organ dysfunction. 
Several factors are directing new approaches to therapy. We now have 
much more insight into the pathogenesis of LSDs: for example, we know 
that dysregulated inflammation is a downstream consequence of lyso- 
somal storage in multiple sphingolipid LSDs’. The mechanisms leading 
to dysregulation of inflammatory signalling remain unclear; however, 
it can be treated with existing cheap and effective drugs. For example, 
studies conducted in some animal models have shown that non-steroidal 
anti-inflammatory drugs (NSAIDs) have modest, but significant, benefits 
when used as a monotherapy *". However, synergy has been achieved 
when anti-inflammatory drugs have been combined with other treat- 
ments, including bone-marrow transplantation and SRT™. The other 
factor driving combination-therapy approaches is economic. Specific 
therapies developed for treating LSDs (for example ERT and SRT) are 
extremely expensive, so combination therapy that uses multiple high- 
cost drugs would be unrealistic. There is therefore an urgent need to 
identify steps in the pathogenic cascade so that they can be targeted with 
pre-existing, cheaper medicines, the safety profiles of which are already 
known, thereby allowing rapid translation into the clinic. 


New developments in our understanding of LSDs 

Several important developments are shedding light on aspects of lyso- 
some biology, LSD pathogenesis and new approaches to treatment. The 
first of these addresses a fundamental question: are lysosomal proteins 
coordinately regulated? Using a bioinformatics approach, the CLEAR 
(coordinated lysosomal expression and regulation) network of genes was 
identified, demonstrating that genes encoding proteins involved in lyso- 
somal function have coordinated transcription controlled by a master 
regulator, the helix-loop-helix transcription factor TFEB®. A tran- 
scriptional regulator, ZKSCAN3, with opposing activity has also been 
identified, providing insight into the negative regulation of lysosome 
biogenesis™”. These key studies demonstrated that lysosome biogenesis 
and function are tightly regulated and imply that TFEB might havea role 
in LSDs in which storage triggers expansion of the lysosomal system. 


Furthermore, the characterization of this regulated gene network offers 
new approaches to control lysosomal biogenesis and potentially increase 
flux through the lysosomal system, which could have therapeutic poten- 
tial in a number of lysosomal disorders. 

Lysosomes, in addition to playing a central part in macromolecule 
catabolism, also mediate plasma-membrane repair™”. During the pro- 
cess of lysosome-cell-membrane fusion, exosomes are released; TFEB 
activation has been shown to greatly enhance this process in a mucolipin- 
1-dependent manner”. MCOLN1 encodes mucolipin-1, which, when 
defective, causes mucolipidosis type IV, another LSD. The function of 
mucolipin-1 is incompletely understood, but has been implicated in the 
regulation of aspects of acidic store calcium homeostasis, suggesting 
TFEB requires lysosomal calcium for its activation or translocation”. 

TFEB activation could therefore offer a practical means to remove 
storage vesicles by promoting exocytosis. A proof-of-concept study has 
shown the benefit of this approach in a mouse model of multiple sul- 
phatase deficiency, in which overexpression of TFEB in vivo led to reduced 
glycosaminoglycan storage and reduced inflammation”. Drugs that can 
differentially regulate TFEB expression, and potentially other regulators 
oflysosome function, could therefore represent a new class of therapeutic 
agent for treating LSDs in the future. 

In the context of sphingolipid LSDs, the question is whether this 
approach would reduce stored insoluble lipids (in contrast to glycosami- 
noglycans). This is because the storage burden released by one cell in the 
form of exosomes might be taken up by macrophages, and potentially 
other immune cells, which can internalize exosome-sized particles and 
redistribute the lipid. However, for those diseases with neuropathology, 
lipid redistribution from the CNS to peripheral tissues could still be of 
clinical benefit’. Interestingly, a new therapeutic approach for the treat- 
ment of patients with NPC disease is the use of 8-cyclodextrin, based on 
efficacy in murine and feline models of the disease’””*. B-cyclodextrin 
has the ability to promote exocytosis by signalling plasma-membrane 
damage in NPC cells in vitro”. Early intervention with B-cyclodextrin in 
mice cleared storage lipids (sphingosine, GSLs and cholesterol) from the 
brain, despite a lack of evidence that it can cross the blood-brain barrier”. 
Peripheral organs such as the liver also had reduced storage levels, except 
for the lung in which foamy lipid-laden macrophages were found, suggest- 
ing lipid redistribution to pulmonary macrophages’. When a small num- 
ber of patients with NPC disease were given §-cyclodextrin in individual 
investigational new drug studies (in the United States, Spain and Brazil), 
stored lipids were found to redistribute to circulating B cells, possibly as 
a result of exosome uptake”. Clearly, careful evaluation of the impact of 
stimulating exocytosis as a therapeutic approach in the sphingolipidoses 
will be an important step towards understanding its therapeutic potential 
in LSDs involving different biochemical classes of stored metabolites. 
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Figure 3 | The pathogenic cascade and therapeutic approaches. Defects 
in individual genes and proteins lead to the storage of substrates. Through 
a complex downstream pathogenic cascade, the steps of which are still to be 
determined, this storage leads to dysfunction and death at the cellular level, 
and to symptomatic disease. Treating disease requires an understanding of 
these downstream steps and the points at which therapeutic intervention is 
possible. Only bone marrow transplantation (BMT), enzyme replacement 
therapy (ERT), substrate reduction therapy (SRT) and symptomatic 
management are approved or in routine clinical practice. 


Unusual mechanisms of cell death in some LSDs 

Most LSDs involve progressive neurodegeneration, it is therefore of 
great importance to understand cell-death pathways, which represent 
another potential therapeutic target. Several studies in mouse models of 
LSDs involving ganglioside storage (for example, Sandhoff disease and 
GM1 gangliosidosis) have provided evidence for a form of programmed 
cell death called caspase-dependent apoptosis”. 

However, another study* ° has uncovered an unusual cell-death mech- 
anism in two sphingolipid storage diseases, type 2 Gaucher disease and 
Krabbe disease. These diseases share a common biochemical hallmark, 
the storage of simple monohexosyl GSLs. Type 2 Gaucher disease is a 
severe neuronopathic form of the disease, leading to death in infancy; 
and is refractory to ERT because the disease manifests in the brain. 


BOX1 
Necroptosis in Gaucher 
and Krabbe disease 


@ Is necroptotic cell death restricted to lipid storage diseases that 
involve storage of monohexosyl ceramides? 

@ If necroptotic cell death is selective for monohexosyl ceramides, 
what steps in the necroptosis pathway do these lipids target? 

@ Where in the cell do storage lipids intersect with the necroptotic 
cell-death pathway? 

@ What level of glucosylceramide (GlcCer) or galactosylceramide 
(GalCer) is required to trigger necroptotic cell death? 

@ Why is necroptosis not a feature of Niemann-Pick type C (NPC) 
with low levels of GlcCer storage? Is it because GlcCer levels 

are lower or because these is a lack of GlcCer transfer to other 
organelles or membranes in NPC disease cells? 

@ Is necroptotic death involved in other neurodegenerative diseases 
(such as Parkinson’s disease) in which the mechanism of cell death 
is still unclear or controversial? 

@ How easy will it be to generate selective inhibitors of RIPK3 to test 
in animal models and potentially in the clinic? 
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Type 2 Gaucher disease results from a profound loss of glucocerebro- 
sidase activity (owing to mutations in GBA) that leads to storage of 
GlcCer and its lyso-derivative glucosylsphingosine. Krabbe disease is 
another severe neurodegenerative LSD caused by storage of a different 
monohexosyleceramide, the galacto-glycoconjugate GalCer, and its 
lyso-derivative psychosine*”. 

In addition to apoptosis, which is perhaps the most intensively 
researched form of programmed cell death®’, necroptosis** is also a 
major programmed cell-death pathway. This pathway is dependent on 
the kinases RIPK1] (ref. 86) and RIPK3, whereby RIPK3 (ref. 87) medi- 
ates caspase-independent cell death. Futerman and colleagues found 
that in the brains of mice that were deficient for Gba in neural and 
glial progenitor cells (a model of acute neuronopathic Gaucher disease) 
there was an absence of TUNEL positive cells, which are indicative of 
apoptotic cell death, even at time points when neurodegeneration was 
advanced, indicating an absence of caspase activity. Furthermore, cas- 
pase expression was normal throughout the disease course, support- 
ing a caspase-independent mechanism of neuronal cell death. Levels 
of RIPK1 and RIPK3 were found to be significantly elevated in these 
mice, and, in the same study, RIPK1 was also found to be increased in 
the post-mortem brain of a patient with type 2 Gaucher disease. In the 
mice, RIPK3 was elevated in both microglia and in neurons, suggesting 
a contribution to neuroinflammation and neuronal cell death, respec- 
tively. Furthermore, the levels of RIPK3 correlated with the extent of 
pathology in different regions of the brain. The necroptosis pathway was 
not activated in mouse models of other LSDs, including NPC disease, 
Sandhoff disease and GM1 gangliosidosis. However, it was activated in 
Krabbe disease. 

The authors also tested the therapeutic potential of targeting necrop- 
tosis by pharmacologically inducing neuronopathic Gaucher disease 
(using conduritol B epoxide, CBE; an irreversible inhibitor of glu- 
cocerebrosidase) in Ripk3-deficient mice. Ripk3 heterozygous mice 
survived for 20-30 days when injected with CBE, whereas the lifespan 
of Ripk3-deficient mice treated with CBE was extended to 100-160 days, 
thereby demonstrating the therapeutic benefit of targeting RIPK3 activ- 
ity in vivo. The development of pharmacological inhibitors of RIPK3 
activity should facilitate clinical investigations of this novel therapeutic 
approach for these two currently intractable LSDs. Answering some of 
the questions raised by this study (Box 1) will help us to understand 
how some storage lipids lead to necroptosis and to define the range 
of diseases that could benefit from treatment with RIPK3 inhibitors. 


Effects of lipid storage beyond the lysosome 

The study of necroptotic pathways in the GlcCer and GalCer storage 
diseases discussed prompts another question, namely, where in the cell 
do the storage lipids affect cellular function, leading to pathology? In 
an Npcl mouse model, no elevation of Ripk1 or Ripk3 was observed 
in the brain®”. This is surprising because GlcCer is one of the GSLs 
stored in NPC, albeit at lower levels than in neuronopathic Gaucher 
disease. In most clinical cases, NPC is caused by inherited defects in 
the integral lysosomal membrane protein, NPC1. Based on the protein's 
homology to the bacterial RND permease family, it probably facilitates 
the efflux of multiple substrates out of the lysosome””**”. In NPC1- 
deficient cells, late-endosome-lysosome fusion is blocked” as a result 
of reduced calcium release” from the lysosome, so that accumulating 
lipids are trapped in the late endosomes and cannot move from one 
membrane to another due to defective trafficking”. This suggests that 
the stored monohexosylceramides that trigger necroptosis in Gaucher 
and Krabbe disease might do so by acting outside the lysosome. The rea- 
soning behind this speculation is that GlcCer is also stored in NPC dis- 
ease, but is retained in late endosomes/lysosomes as a result of blocked 
late-endosome-lysosome fusion. This concept is reminiscent of previ- 
ous studies that implicated defective calcium homeostasis in LSDs”’. 
GlcCer accumulation in Gaucher disease, GM2 ganglioside storage in 
Tay-Sachs and Sandhoff diseases, and GM1 storage in GM1 gangliosi- 
doses were all found to cause dysregulated endoplasmic reticulum (ER) 
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calcium homeostasis through unique mechanisms that lead to elevated 
levels of cytosolic calcium. In GM1 gangliosidosis cells, it was found that 
stored lipids escape the lysosome and are present in ER membranes, 
hence their impact on ER calcium homeostasis”. These data suggest that 
there is lipid exchange from the lysosome to other membranes, includ- 
ing the ER. By contrast, NPC disease, in which GM2 ganglioside and 
GlcCer accumulate, has normal ER calcium homeostasis, presumably 
as a consequence of the block in late-endosome-lysosome fusion that 
prevents lipids stored in late endosomes from moving into other mem- 
branes”. This highlights the significance of not only the biochemical 
nature of the stored material and its overall levels but also its intracel- 
lular distribution. It is conceivable that in several LSDs it is the storage 
lipids in compartments other than the lysosome that contribute to key 
aspects of pathogenesis” : 


Unexpected links to Parkinson’s disease 

As more genetic studies have been conducted to elucidate pathways 
that are involved in neurodegenerative diseases, a common theme 
has emerged, namely defective endosomal/lysosomal transport is a 
common, shared feature”. The mechanisms that cause neurons to 
malfunction and ultimately die when the function of late endosomes 
or lysosomes is defective remain obscure. However, it is clear that 
they have a greater dependency on correct trafficking and endocytic 
system function than many other cell types”. This could reflect their 
unique anatomical, metabolic and/or functional characteristics, but 
at present we lack a good mechanistic understanding of their differ- 
ential vulnerability. This topic has recently been discussed in detail in 
an excellent review”, as has the dysregulation of autophagy, another 
common feature of neurodegenerative diseases”. If we want proof of 
the importance of normal lysosomal function for neuronal health, 
we need look no further than the LSDs, in which defects in many 
different, unique aspects of late-endosome or lysosome function 
commonly result in a relentless neurodegenerative clinical course”. 
Historically, LSDs have been studied as discrete metabolic diseases 
in their own right. However, in the past few years some unanticipated 
links with more common diseases have emerged, placing the lyso- 
some at centre stage in a range of severe human diseases. Probably 
the most extensively researched example is the link between Gaucher 
and Parkinson's disease”’. 

Cases of patients with Gaucher disease who had parkinsonian 
symptoms were reported in the late 1990s”*””. The clinical presenta- 
tion of parkinsonian symptoms in one study cohort was bilateral aki- 
netic rigidity and poor response to levodopa treatment’, but there 
was variability in clinical presentation and age of onset'’'. Because 
having a rare monogenic disease is unfortunately not a barrier to 
developing more common diseases, this could simply have reflected 
chance. However, over time, more cases were reported”, and it was 
observed’ that the frequency of Parkinson’s disease in unaffected 
family members of patients with Gaucher disease (some of whom 
were carriers of the GBA gene mutation) was higher than would be 
expected in the general population. Genetic studies to determine 
whether there is a significant link between Gaucher and Parkinson's 
disease showed that carrying a GBA mutation was the highest risk fac- 
tor for developing Parkinson’s, elevating risk fivefold for Parkinson’s 
and eightfold for Lewy body dementia, a condition associated with 
Parkinson’s*'*’. Moreover, heterozygous carriers of GBA mutations 
that are responsible in homozygotes for the development of Gaucher 
disease seem to be affected by parkinsonism with equal frequency, 
strongly suggesting the presence of a ‘toxic allele’ effect. A number 
of interesting questions arise from these studies of the mechanism 
linking Gaucher and Parkinson's (Box 2). Anecdotal reports have 
also described Parkinson's disease in patients with other LSDs’”’. 
Additional genes involved in lysosomal function have been linked to 
Parkinson's disease and have been reviewed'”’. These findings suggest 
that lysosomal dysfunction might be linked to other neurodegenera- 
tive diseases. This will no doubt be a hot topic of research in the field 
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BOX 2 
The link between Gaucher 
and Parkinson’s disease 


@ How does carrying one mutant GBA allele increase the risk of 
Parkinson’s disease? 

@ Most mutant GBA alleles that are associated with Parkinson’s 
disease lead to misfolded proteins, so is there a gain-of-function 
mechanism? 

@ Does mutant glucocerebrosidase enhance a-synuclein 
aggregation and, if so, how? 

@ Does the GBA mutant allele impair lysosomal function, 
overloading other pathways, including the proteasome pathway? 

@ Does the GBA mutation in the heterozygous state lead to altered 
glucosylceramide metabolism in certain specialized cell types, such 
as dopaminergic neurons? 

@ Is it possible that necroptotic cell death in neuronopathic Gaucher 
disease also occurs in Parkinson’s disease? 

@ Why do most individuals with a mutant GBA allele not develop 
Parkinson’s disease, and what are the other risk factors? 


for years to come, and I predict that this is the tip of the iceberg in 
terms of links between lysosomal dysfunction and more-common 
human diseases. 


Future directions 

Remarkable progress has been made in our understanding of classical 
LSD, which has led to a number of approved therapies that are improy- 
ing the quality of life for patients. Treating the brain remains an unmet 
clinical need, and innovative experimental therapies that cross the 
blood-brain barrier or are delivered directly to the CNS are currently 
under investigation. There is also a growing appreciation of the benefits 
of combination therapy and the part it will play in the clinical manage- 
ment of patients in the future. A better understanding of the patho- 
genic cascade in these diseases must be a priority, because this will no 
doubt identify adjunctive therapies that target novel clinical interven- 
tion points, potentially with approved drugs, thereby speeding the path 
to translation to the clinic. The unanticipated links between Gaucher 
and Parkinson's disease are provoking a re-think of lysosomal disorders 
beyond inborn errors of metabolism. How many other more-common 
diseases involve lysosomal dysfunction? Answering this question will 
have therapeutic ramifications because therapies developed to treat 
rare monogenic diseases might have unanticipated uses in much more 
common diseases that also involve lysosomal dysfunction. Therefore, 
paradoxically, lysosomal diseases could be much more common than 
we currently think. Knowledge gained from studying rare monogenic 
LSDs could help us to unravel disease mechanisms of sporadic polygenic 
diseases such as Parkinson's that may also involve lysosomal dysfunc- 
tion. Understanding disease mechanisms in the GSL metabolic diseases 
will also provide fundamental insights into the biological roles of this 
enigmatic family of lipids. = 
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The different shades of fat 


Vivian Peirce’, Stefania Carobbio’” & Antonio Vidal-Puig!” 


Our understanding of adipose tissue biology has progressed rapidly since the turn of the century. White adipose tissue has 
emerged as a key determinant of healthy metabolism and metabolic dysfunction. This realization is paralleled only by the 
confirmation that adult humans have heat- dissipating brown adipose tissue, an important contributor to energy balance 
and a possible therapeutic target for the treatment of metabolic disease. We propose that the development of successful 
strategies to target brown and white adipose tissues will depend on investigations that elucidate their developmental 


origins and cell-type-specific functional regulators. 


view that white adipose tissue (WAT) is a passive storage organ for 

excess calories has been revised. WAT does have an important role 
in buffering nutrient availability and demand by storing excess calories 
and preventing the toxic accumulation of excess nutrients in non-adipose 
tissues’; however, WAT also communicates with metabolically relevant 
organs by secreting so-called adipokines as part of a dynamic endocrine 
system that regulates nutrient partitioning into peripheral tissues’. 

The recent confirmation that adult humans have brown adipose tissue 
(BAT) has transformed our understanding of how adipose tissue regulates 
metabolism and energy balance once again* *. The primary function of 
BAT is to maintain core body temperature in response to cold stress by 
generating heat, a process known as thermogenesis’. Like the unilocular 
white adipocytes in WAT, the multilocular brown adipocytes in BAT also 
accumulate and store lipids’. However, brown adipocytes are distinct from 
white adipocytes in that their more abundant mitochondria are enriched 
with uncoupling protein 1 (UCP1), which uncouples substrate oxidation 
from ATP production so that heat is produced’. 

The fact that heat production by BAT is an extremely energy-expensive 
process that burns nutrients has two important caveats. First, to avoid 
hyperthermia and to regulate energy usage by BAT, thermogenesis in 
this tissue must be tightly controlled so that heat is specifically produced 
in response to thermogenic stimuli. Second, when stimulated to produce 
heat, the so-called activated BAT has an enormous impact on energy bal- 
ance, at least in rodents. For example, mice housed at room temperature 
(20-22°C) would need to consume 60% more food to maintain their 
body weight compared with mice housed at thermoneutral temperatures 
(30°C) to compensate for the increased energy expenditure as a result of 
increased thermogenesis’”. Conversely, even at thermoneutrality, BAT 
ablation or dysfunction in rodents decreases energy expenditure, causing 
an obese phenotype’. In addition to its thermogenic effects, activated 
BAT has a large capacity for glucose and lipid uptake per gram of tissue, 
as demonstrated by its ability to normalize hyperglycaemia and hyperlipi- 
daemia in mouse models of diabetes and dyslipidaemia’*”*. In line with its 
remarkable capacity for substrate oxidation, BAT is activated in rodents 
in response to excess nutrient consumption, such as eating a high-fat diet, 
a process known as diet-induced thermogenesis’. Therefore, although 
WAT far outstrips BAT in terms of percentage body mass, activated BAT 
is an important contributor to nutrient partitioning and utilization, and 
body weight regulation. 

The revelation that adult humans possess such a potent metabolic tissue 
was accompanied by the discovery that UCP1-expressing thermogenic 
adipocytes can also be found in WAT in the form of beige adipocytes'™”. 


E the face of the modern obesity epidemic the previously prevailing 


In rodents, chronic cold exposure increases thermogenic capacity by 
increasing the number of brown adipocytes, thereby recruiting BAT 
mass’. As part of the same response, so-called beige or ‘brite (brown in 
white) cells are also recruited in WAT, resulting in WAT ‘browning”*”. 
Although there is significant overlap between beige and brown adipocytes 
in their expression of UCP1 and the genes required for thermogenesis, 
beige adipocytes have distinctive gene expression signatures, indicating 
that they are a distinct type of thermogenic cell'*'. Unstimulated rodent 
beige adipocytes are similar to rodent white adipocytes in terms of oxygen 
consumption rates and uncoupled respiration in vitro'*. Compared with 
brown adipocytes, however, beige adipocytes have a greater capacity for 
thermogenic stimulation, with greater inducibility of UCP1 expression 
and proportionally increased uncoupled respiration’®. UCP1 expression 
and uncoupled respiration in stimulated beige adipocytes could equal 
that of stimulated brown adipocytes’*”’. In summary, when consider- 
ing adipose tissue involvement in the regulation of energy balance, the 
contribution of three different ‘shades’ of fat — white, brown and beige 
— should be considered. 

The discovery of these different shades of fat has important impli- 
cations when considering the pathogenesis of, and treatments for, the 
obesity-related metabolic complications that characterize metabolic syn- 
drome. WAT dysfunction is a key pathological phenomenon contributing 
to metabolic syndrome. Although WAT is able to functionally adapt and 
expand in response to chronic calorie overconsumption, individuals seem 
to have a set functional limit beyond which WAT fails to function prop- 
erly as a storage and endocrine organ”. This can result in the lipotoxic 
accumulation of lipids in peripheral tissues that contributes to insulin 
resistance and dyslipidaemia”’. Conceptually, this lipotoxicity could be 
ameliorated either by increasing the capacity of WAT for nutrient storage 
and/or by eliminating these excess calories by increasing the activity of 
BAT. In our opinion, the development of successful adipose-tissue-based 
therapeutic strategies to treat metabolic syndrome is reliant on a good 
understanding of basic adipose-tissue biology. We will discuss how rodent 
models have deepened our understanding of the developmental origins 
and cell-type-specific functional regulators of white, brown and beige 
adipocytes, and the current big challenge to translate this information 
from rodents to humans. 


The origins of adipocytes in rodents 

Following their establishment in early life, adipose-tissue depots undergo 
active remodelling throughout adulthood. Both WAT and BAT have the 
capacity to expand in response to calorie excess and chronic cold stress, 
respectively. To enable this expansion, it is important that precursor 
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cells with adipogenic potential are retained in adulthood”. We believe 
that understanding how the lineage origins of these adipocyte precur- 
sors biases their potential to adopt a brown-, beige- or white-cell fate on 
differentiation (reviewed in ref. 26) is essential for the development of 
therapeutic approaches that target these cells. 


Embryonic and adult origins of adipocytes 
Lineage-tracing studies have shown that both brown adipocytes and 
myocytes derive from MYF5* and PAX7’* progenitors that originate 
in the paraxial mesoderm, and that brown adipocytes can also derive 
from stem-cell-like skeletal muscle satellite cells in adulthood” * (Fig. 1). 
Moreover, brown adipocyte precursors express muscle-specific tran- 
scription factors, and it is known that expression of some myogenic 
microRNAs (miRNAs)persists in mature brown adipocytes*’”’. Given 
the absence of this myogenic gene expression signature in white adipo- 
cytes and their precursors, it was expected that white adipocytes would 
derive preferentially from a MYF5" lineage”. However, this view was 
recently challenged when the conditional depletion of PTEN driven by 
MYF5-Cre caused overgrowth of BAT, but also a paradoxical overgrowth 
of specific WAT depots and a loss of others**. Subsequent lineage-tracing 
studies have demonstrated the presence of some MYF5" adipocyte pre- 
cursors in WAT, indicating that white adipocyte precursors can derive 
from both MYF5* and MYF5’ lineages**™*. Unexpectedly, lineage-tracing 
studies using the endothelial marker vascular endothelial cadherin or 
the pre-adipocyte marker Zfp423 also suggested that some brown and 
white adipocytes could have endothelial origins, despite a recent study 
finding that multiple endothelial markers did not label adipocytes***”. 
Given that beige adipocytes can be derived from white adipocyte pre- 
cursors in vitro by chronic treatment with PPAR-y agonists, it is likely 
that beige adipocytes share the MYF5 lineage origin of most white adi- 
pocytes”’**”*». In terms of adult origins, recent evidence suggests that 
beige adipocytes can form either directly from white adipocytes or due 
to the proliferation of precursors'***. For example, using mice in which 
UCP1-expressing cells are labelled both permanently and transiently 
with fluorescent proteins, beige adipocytes recruited by 7 days of cold 
exposure were found to arise directly from white adipocytes”. Interest- 
ingly, these cells return to a white phenotype after warm adaptation, 
indicating that this interconversion is likely to be bidirectional”. How- 
ever, contrasting results have been found using a mouse model in which 
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Figure 1 | Origins of white, beige and brown adipocytes. Brown adipose 
tissue (BAT) is characterized by UCP1* brown adipocytes, whereas white 
adipose tissue (WAT) contains a mixture of UCP1 white adipocytes 

and UCP1* beige adipocytes. Adipose tissue expansion in adulthood is 
achieved mainly by the proliferation and differentiation of adipocyte 
precursors to produce new adipocytes. WAT and BAT adipocyte 
precursors derive from mesenchymal precursor cells. Recent evidence 
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existing adipocytes can be labelled with B-galactosidase (B-gal) using 
doxycycline, so that any adipocytes generated de novo after doxycycline 
treatment are B-gal” (ref. 41). After doxycycline treatment, 3 days of cold 
exposure induces the appearance of multilocular B-gal adipocytes in 
subcutaneous WAT, suggesting that these are beige adipocytes produced 
by clonal expansion ofa precursor“. This is consistent with results from 
another study that tracked beige adipogenesis using bromodeoxyuridine 
accumulation to show that beige adipocytes arise de novo in WAT in 
response to adrenergic stimulation”. Subsequent lineage tracing studies 
indicated that self-renewing PDGFR-a* precursors were a significant 
source of these newly formed beige adipocytes”. These are ‘bipotential’ 
precursors because they are able to produce both beige and white adipo- 
cytes when cultured in vitro”. In addition to potentially deriving from 
precursors with both white and beige adipogenic potential, beige adipo- 
cytes may derive from dedicated beige adipocyte precursors. In a study 
characterizing the in vitro adipogenic potential of immortalized WAT 
and BAT precursors, some immortalized WAT precursors differentiated 
preferentially into beige adipocytes'®. These results suggest that differ- 
ent types of adipocyte precursors in WAT may vary in their potential 
to produce beige adipocytes, perhaps due to their lineage origins. For 
example, PAX3 adipocyte precursors and MYF5 adipocyte precursors 
isolated from WAT differentiate into adipocytes with higher expression 
levels of thermogenic genes compared with PAX3* precursors or MYF5* 
precursors, respectively’. Taken together, these observations suggest 
that the transcription factor networks that drive adipocyte differentia- 
tion towards the brown-, beige- or white-cell fate might be affected by 
the specific lineage origins of precursor cells. 


Key factors driving adipogenesis 

Despite the differences in the embryonic origins and physiological 
functions of brown and white adipocytes, both cell types share a simi- 
lar transcriptional cascade that controls terminal differentiation into 
mature adipocytes (reviewed in ref. 44). PPAR-y and three CCAAT/ 
enhancer-binding protein family members (C/EBP-a, C/EBP-B and C/ 
EBP-6) have a role in both white and brown adipogenesis by cooperat- 
ing to promote a transcriptional cascade that leads to and maintains the 
stable differentiation of the adipocyte™. This means that, for example, 
selective recruitment of thermogenic adipocytes requires the identifica- 
tion of specific factors that robustly drive a thermogenic brown or beige 
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suggests that WAT adipocyte precursors can derive from both MYF5* and 
MYF5 lineages, whereas BAT adipocyte precursors derive exclusively 
from a MYF5*' lineage. Beige adipocytes can derive from WAT adipocyte 
precursors and potentially directly from mature white adipocytes. Brown 
adipocytes can also be generated from stem-cell-like skeletal muscle 
satellite cells, and both brown and white adipocytes may derive from 
endothelial precursors. 
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Figure 2 | Factors regulating white, and brown or beige adipogenesis. 
Adipogenesis is driven by a cascade of transcription factors (C/EBP-5, C/EBP- 
6, C/EBP-a, PPAR-y, PGC1-a and SREBP Ic) that establish and maintain the 
stable differentiation of the adipocyte. This transcriptional cascade interacts 
with other transcription factors, co-activators and microRNAs during the 
decision to proliferate or differentiate and the decision to adopt a brown 

or beige ‘thermogenic’ cell fate or a white ‘non-thermogenic’ cell fate. For 
example, miR-155 is involved in a feedback loop with C/EBP- that regulates 
the decision to proliferate or differentiate. During adipogenesis, PPAR-a, 
PGC- 1a, SRC-1, PRDM16, miR-196a and miR-193b-365 drive a thermogenic 
brown or beige phenotype, whereas RIP140, TIF2, miR-133 drive a white 


phenotype. 


phenotype during adipogenesis. Recent progress using rodent models 
suggests that determining the brown-, beige- or white-cell fate during 
adipogenesis requires three levels of control involving transcription fac- 
tors and co-activators, epigenetics and miRNAs (Fig. 2). 

Although PPAR-y is a key driver of the general adipogenic pro- 
gramme, treatment of brown adipocyte precursors with PPAR-y agonists 
potentiates brown adipogenesis and the same treatment biases murine 
and human white adipocyte precursors towards a beige-cell fate*”***”. 
Other transcription factors and co-activators are known to drive adi- 
pogenesis towards a white or brown fate. For example, the corepres- 
sor RIP140 and the co-activator TIF2 help to drive white adipogenesis, 
whereas the transcription factors PPAR-a and FOXC2 and the PPAR-y 
co-activators PGC-1a, SRC1 and PRDM16 promote brown adipogen- 
esis“. Despite PGC-1a’s importance in regulating the expression of 
thermogenic genes in brown adipocytes, it also regulates mitochondria 
biogenesis and oxidative metabolism in many cell types and organs”. 
By contrast, PRDM16 seems to be a more specific driver of thermo- 
genic gene expression. PRDM 16 is a co-activator that forms a complex 
with multiple adipogenic transcription factors, including PPAR-y and 
PGC-1a°°***”, This complex may drive commitment to a brown adi- 
pocyte fate (rather than a muscular cell fate) during brown adipogen- 
esis, but might be more important for beige adipocyte recruitment in 
WAT’*“**, Whereas adipose-tissue-specific deletion of PRDM16 blocks 
the induction of thermogenic genes in WAT in response to adrenergic 
stimulation, paradoxically it has almost no effect on BAT development”. 

It is also important to understand how specific transcriptional driv- 
ers such as PRDM 16 and PPAR-y integrate with other mechanisms that 
regulate cell-fate decisions. For example, epigenetic factors can influ- 
ence the availability of the promoters targeted by these fate-determining 
transcription factors. In brown adipocyte precursors, the lysine methyl- 
transferase EHMT1 drives brown adipogenesis by stabilizing PRDM16 
and regulating histone methylation at muscle-selective gene promot- 
ers’. Furthermore, transcription factors can be covalently modified to 
regulate their activity. This mechanism is exemplified by the deacetylase 
SIRT 1, which targets multiple transcription factors such as PGC-1a and 
FOXO1 to regulate oxidative metabolism and glucose homeostasis”, 
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In addition, SIRT 1-mediated deacetylation of PPAR-y favours beige adi- 
pocyte recruitment in WAT, and SIRT 1-mediated deacetylation of the 
myogenic transcription factor MyoD might repress the myogenic gene 
programme during brown adipogenesis'”*'°***. 

Finally, miRNAs have recently emerged as regulators of cell differentia- 
tion and metabolism in multiple tissues, including BAT and WAT”?*?, 
For example, in brown adipocyte precursors miR-155 is involved in a 
feedback loop with C/EBP-f that regulates the decision to proliferate or 
differentiate”. Multiple miRNAs also fine-tune the fate of differentiating 
WAT and BAT adipocyte precursors. miR-27 is a negative regulator of the 
brown and beige adipogenic programme in BAT and WAT that is down- 
regulated by cold exposure, whereas miR-196a expression is increased 
in WAT precursor adipocytes by cold exposure or B-adrenergic stimu- 
lation to induce the recruitment of beige cells°”**, In addition, miRNAs 
coordinate with PRDM16 to regulate the cell-fate decision to become 
a brown adipocyte rather than a myocyte. In this context, PRDM16 
drives brown adipogenesis by inducing the expression of the miRNA 
cluster miR-193b-365, which targets the brown adipogenic repressor 
Runx1tl (ref. 59). Conversely, miR-133 enforces myogenic commitment 
by repressing PRDM 16 expression, and is downregulated in BAT and 
WAT during brown and beige adipogenesis to allow increased PRDM16 
expression”**, 


Regulation of adipocyte function in rodents 

Therapeutic strategies that increase the number of brown and beige adi- 
pocytes by driving adipogenesis towards a thermogenic cell fate might not 
elicit metabolic benefits unless these thermogenic cells are appropriately 
activated. Although white, brown and beige adipocytes have divergent 
roles in terms of energy homeostasis, they are all functionally coordinated 
by integrating sympathetic nervous and endocrine control. 


The role of the sympathetic nervous system 
In times of nutrient excess, energy uptake by white adipocytes is 
principally regulated by insulin’s anabolic role, mediating glucose 
and lipid uptake, lipogenesis and inhibition of lipolysis”. By contrast, 
the sympathetic nervous system (SNS) regulates responses to energy 
deficits caused by energy shortages (for example, fasting) or increased 
energy expenditure (for example, cold exposure)”. The physiologi- 
cal response to cold exposure requires an SNS-coordinated crosstalk 
between white, brown and beige adipocytes with respect to nutrient 
release, partitioning and utilization (reviewed in refs 9, 64; Fig. 3). 
For example, increased sympathetic tone to WAT triggers lipolysis 
so that intracellular lipid stores are released to provide nutrients for 
other organs, particularly for activated BAT™. Increased sympathetic 
tone to BAT also triggers lipolysis, providing free fatty acids to locally 
activate UCP1 and to serve as substrates for oxidation”. Importantly, 
the substrate uptake machinery is also upregulated, allowing lipids 
released by WAT to be taken up by BAT”. Sustained SNS activation 
will also result in increased thermogenic capacity due to increased 
thermogenic gene expression and, in cases of chronic activation, due 
to proliferation and differentiation of brown adipocyte precursors’. 
Increased SNS tone to WAT also recruits and activates beige adipo- 
cytes, which might contribute to increased total thermogenic capac- 
ity’*’”. The functional importance of beige adipocytes is a burning 
question that has yet to be compellingly clarified in rodents. This 
could be because it is difficult to experimentally distinguish the ther- 
mogenic contribution of beige adipocytes from that of brown adipo- 
cytes, and because the many factors — including cold exposure — that 
recruit and activate beige adipocytes also recruit and activate brown 
adipocytes. In support of a physiological role for beige adipocytes in 
energy balance, murine-strain-dependent differences in resistance 
to diet-induced obesity have been attributed to the inducibility of 
beige-adipocyte recruitment in WAT™. In addition, increased recruit- 
ment of beige adipocytes has been shown to compensate for decreased 
BAT thermogenesis, and the specific loss of beige adipocytes due to 
adipose-tissue-specific PRDM16 depletion causes obesity”. 
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In any case, because activating thermogenesis could be of therapeu- 
tic interest, strategies to optimize SNS-targeted activation might be 
valuable. However, pharmacological strategies that indiscriminately 
increase sympathetic signalling result in cardiovascular side effects 
that are unacceptable when treating obesity”. The tissue-selective 
distribution of specific B-adrenergic receptor (B-AR) isoforms sug- 
gested it was feasible to target adipose tissue specifically through the 
B3-AR isoform”™. However, whereas in rodents B3-AR agonists mimic 
the effects of the cold on BAT and WAT, a human {3-AR agonist that 
is both potent and selective remains elusive. This emphasizes the 
therapeutic relevance of mechanisms that could increase adrenergic 
responsiveness of thermogenic adipocytes, and of alternative, non- 
adrenergic pathways that can activate BAT. 


Alternative pathways that activate thermogenesis 

It is possible to directly manipulate the adrenergic signalling path- 
way to sensitize adipocytes to adrenergic stimulation. For example, 
the transcription factor FOXC2 sensitizes adipocytes to adrenergic 
stimulation by modulating the expression and activity of adrenergic 
signalling molecules” ”’. In addition, BMP8B is produced by brown 
adipocytes in response to adrenergic stimulation, and increases the 
maximal thermogenic response to adrenergic stimulation”. BMP8B 
induces the phosphorylation of downstream adrenergic targets such 
as hormone-sensitive lipase and p38 MAPK (Fig. 3) by signalling 
through Smad1, 5 and 8 (ref. 72). This pathway acts in parallel to and 
synergistically with cAMP and protein kinase A (PKA)-dependent 
adrenergic signalling”. BMP7 can also drive brown adipogenesis and 
recruit beige adipocytes by activating p38 MAPK””* 

Moreover, several alternative mechanisms that activate brown and 
beige adipocytes independently of the SNS have been discovered”. 
For example, stimulated brown adipocytes are a significant source of 
FGF21, a previously recognized liver product that is known to act on 
the liver and WAT to regulate glucose homeostasis”. FGF21 released 
by brown adipocytes acts locally to further increase BAT thermogen- 
esis, but also enters the circulation and acts systemically to induce 
beige adipocyte recruitment in WAT by stabilizing PGC-1a post- 
translationally”*””. In this way FGF21 can coordinate thermogenesis in 
both BAT and WAT’*”. In addition, the adrenergic target PKA shares 
many intracellular targets with cGMP-dependent protein kinase G 
(PKG). As a result, cardiac natriuretic peptides (CNPs), which act 
through the cGMP-PKG pathway, can induce lipolysis and the expres- 
sion of thermogenic genes in brown and white adipocytes, a response 
which is additive to that of noradrenaline”. Therefore, like BMP8B 
signalling, CNP-signalling pathways could be manipulated to boost 
the responsiveness of brown and beige adipocytes to pre-existing sym- 
pathetic tone, or even bypass adrenergic mechanisms altogether to 
recruit beige adipocytes. 

In line with this strategy, the hormone irisin and the cytokine cardio- 
trophin-1 preferentially elicit a thermogenic response in WAT****’. Car- 
diotrophin-1 is secreted by the heart and binds glycoprotein 130 (gp130), 
acytokine receptor that classically activates the JAK-STAT pathway”. 
Irisin is produced by white adipocytes and muscle, and acts preferen- 
tially on white adipocyte precursors to recruit beige cells by activating the 
adrenergic target p38 MAPK as well as ERK1/2 (refs 18, 81, 83, 84). The 
fact that inhibiting ERK signalling reduces the ability of irisin to increase 
UCP! expression highlights the importance of this parallel kinase cascade 
in the thermogenic response™. 

Finally, the regulation of mitochondrial dynamics is emerging as an 
important pathway that controls thermogenesis. Adrenergic stimu- 
lation of brown adipocytes was recently found to promote uncou- 
pled respiration by inducing mitochondrial fission*. FOXC2 also 
increases thermogenesis, partly by increasing the expression of the 
mitochondrial transcription factor mtTFA (also known as Tfam), 
thereby driving mitochondrial biogenesis”’. Overall, there are sev- 
eral ‘alternative’ pathways that regulate BAT activity, and that could 
be explored therapeutically. 
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Figure 3 | Regulation of lipolysis and thermogenesis in adipocytes 

by the sympathetic nervous system. The sympathetic nervous system 
triggers intracellular signalling events that lead to lipolysis (black arrows). 
Noradrenaline (NA) released by sympathetic nerve endings binds to 
B-adrenoreceptors (§-ARs), which couple to Ga G-proteins that activate 
adenylate cyclase (AC), increasing the cellular concentration of cAMP. This 
activates protein kinase A (PKA), which phosphorylates lipid droplet proteins 
including hormone-sensitive lipase (HSL) and perilipin, allowing activation 
of adipose triglyceride lipase (ATGL). The net result is the hydrolysis of 
triglycerides stored in the lipid droplet to free fatty acids (FFAs), which are 
released. This response can be attenuated (red arrows) by the activation of 
phosphodiesterases (PDE), which hydrolyse cAMP. In the case of recruitment 
and activation of beige adipocytes and the activation of brown adipocytes, this 
signalling pathway (blue arrows) also leads to increased expression of UCP1 
and other thermogenic genes through the p38 MAPK cascade. FFAs released 
by lipolysis activate existing UCP1 in the mitochondria, and their oxidation 
produces heat. 


Looking beyond brown and beige adipocytes 

The discovery that macrophages within BAT are an alternative source 
of noradrenaline highlights the fact that BAT consists of more than just 
brown adipocytes*®. Given the dependence of optimal BAT activation on 
innervation and vascularization for gas exchange and heat dissipation, 
other cell types such as endothelial cells and nerve cells seem to be impor- 
tant contributors to overall BAT function’. Therefore, the remodelling 
of vascular and nervous networks in adipose tissue could enhance the 
therapeutic benefits of brown and beige adipocyte recruitment and acti- 
vation. As well as recruiting thermogenic adipocytes, cold exposure also 
induces angiogenesis and increases the density of sympathetic neurons 
in adipose tissue'**”**. Furthermore, transgenic rodent models have also 
demonstrated that thermogenesis and angiogenesis are coupled. Adi- 
pose-tissue-specific transgenic expression of FOXC2 in mice results in 
WAT browning and increased vascularization in WAT due to increased 
expression of the angiogenic factor angiopoietin 2 (refs 9, 89). In addi- 
tion, adipose-tissue-specific transgenic expression of VEGF induces 
angiogenesis, but also increases energy expenditure and induces beige- 
cell recruitment in WAT”. 

Up to this point, we have predominantly focused on thermogenesis- 
based strategies that could ameliorate obesity and metabolic lipotoxic- 
ity by increasing energy expenditure and disposing of nutrient excess. 
However, an alternative anti-lipotoxic strategy would be to increase the 
capacity of WAT for nutrient storage by improving WAT function and 
enabling WAT expansion. Just as angiogenesis seems to playa key part in 
BAT expansion and in WAT browning, this process is also important in 
WAT expansion, in the context of a positive energy balance. In line with 
this, hypoxia caused by inadequate vascularization of enlarged hyper- 
trophic adipose-tissue depots is one mechanism that drives WAT dys- 
function in obesity”. Additional mechanisms include altered cytokine 
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release leading to insulin resistance, immune cell infiltration, inflamma- 
tion and fibrosis, and are reviewed elsewhere”. 

Given the deleterious effects of hypoxia on adipocyte function, it is 
intuitive to suggest that improving the vascularization of WAT could 
improve WAT function. Experimentally, it seems that either promot- 
ing’ or preventing”’”*** angiogenesis in adipose tissue can improve 
the metabolic health of diet- and genetically induced obese mice. One 
hypothesis to explain this apparent contradiction is that blocking angi- 
ogenesis in a state of pre-existing obesity improves WAT function by 
inducing preferential apoptosis of hypoxic, hypertrophic adipocytes so 
that only smaller, healthier adipocytes remain, whereas promoting angi- 
ogenesis in the early stages of obesity enables healthy WAT expansion”. 

In line with this, chronic treatment of obese rats with PPAR-y agonists 
leads to increased weight-gain due to the accumulation of fat mass”. 
Paradoxically, this is associated with increased insulin sensitivity”. 
WAT expansion can also be achieved by modulating the action of the adi- 
pokine adiponectin, by elevating its circulating levels or by adiponectin 
receptor agonism™*"”. In diet- and genetically induced obese mice, such 
treatments further increase body weight and subcutaneous WAT mass, 
but also drastically improve carbohydrate and lipid metabolism”. 
These metabolic benefits are associated with improved WAT function, 
characterized by decreased adipocyte size (although increased number) 
and decreased WAT inflammation” ’”. In an extreme example of facili- 
tating WAT expansion, ob/ob mice with adipose-tissue-specific trans- 
genic expression of the mitochondrial protein mitoNEET gain almost 
twice as much weight as non-transgenic ob/ob mice!”'. Nevertheless, 
these mice have smaller adipocytes in subcutaneous WAT and main- 
tain metabolic parameters comparable with those of wild-type mice™”. 
Although reducing adipocyte size is associated with metabolic improve- 
ments in obesity, allowing adipocytes to expand freely by knocking out 
the gene encoding the extracellular matrix component collagen VI also 
reduces inflammation in WAT and improves whole-body glucose and 
lipid metabolism". Taken together, these mouse models demonstrate 
that enabling WAT expansion to buffer nutrient excess can ameliorate 
the lipotoxic metabolic complications of obesity. 


The relevance of BAT in humans 

Given that increased fat mass is central to obesity, it is intuitive to 
target WAT to develop therapeutic strategies for the treatment of 
metabolic syndrome. Although the use of rodent cellular and in vivo 


models has greatly informed our current understanding of BAT func- 
tion and highlighted BAT as a potential anti-obesity target, investiga- 
tions in human BAT are significantly less advanced. 


Molecular characterization of human BAT 

As discussed previously, rodents possess two distinct classes of thermo- 
genic adipocytes. Brown and beige adipocytes have distinct developmen- 
tal origins and gene expression patterns and are not necessarily optimally 
stimulated by the same activating molecules. Therefore, determining 
which types of thermogenic adipocyte are found in human BAT could 
have profound implications for the successful development of species- 
specific potent activators. UCP1* adipocytes in adult human supracla- 
vicular BAT occur in clusters interspersed among white adipocytes, a 
pattern that is reminiscent of the beige cells recruited by cold exposure 
within the subcutaneous WAT depot in rodents’ *. Ona molecular level, 
the expression of key thermogenic genes such as UCP1, PGC1A (also 
known as PPARGCIA) and PRDM 16 in this and other human BAT depots 
confirms the thermogenic potential of this tissue, but cannot discriminate 
between a brown-like or beige-like phenotype? *87"'°", 

Thorough molecular comparisons of rodent brown, beige and white 
adipocytes have identified several candidate discriminatory markers at 
the messenger RNA level (Supplementary Table 1; Fig. 4)'*7°*!6"°°. The 
expression levels of these genes have been measured in multiple human 
BAT depots in adults and infants'***'* "°°, Some rodent brown markers 
are expressed in human WAT and therefore might not be discriminatory 
for human thermogenic adipocytes’”’. Nevertheless, overall these results 
indicate that human BAT depots contain both brown-like and beige-like 
adipocytes (Supplementary Table 2; Fig. 4)'*’°'"'*. For example, the 
infant interscapular BAT (iBAT) depot is dominated by multilocular 
UCP1* adipocytes and separated from surrounding WAT by connective 
tissue, a structural arrangement like that of the rodent iBAT depot. 
Molecular characterization of infant iBAT indicates that this depot has 
higher expression levels of the rodent brown-adipocyte marker ZIC1 and 
lower expression levels of the rodent beige-adipocyte marker TBX1 com- 
pared with adult human supraclavicular BAT, suggesting a truly brown 
identity’™. In comparison with other adipose-tissue depots, adult haman 
supraclavicular BAT is particularly enriched with markers of rodent 
beige adipocytes'*”"" (Fig. 4). However, evidence suggests that adult 
humans do possess a population of bona fide brown adipocytes in human 
deep neck fat’. In deep neck adipose tissue, UCP! expression increases 
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Figure 4 | Assessment of rodent brown and beige adipocyte markers in 
human adipose tissue. a, Well-established rodent brown, beige and white 
markers. Both white and beige markers are found in subcutaneous white 
adipose tissue (WAT) depots, but beige markers are upregulated after cold 
exposure. b, Locations of the brown adipose tissue (BAT) and WAT depots 
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in which the reliability of these rodent markers has been assessed in infants 
and adults. iBAT, interscapular BAT; supBAT, supraclavicular BAT; scWAT, 
subcutaneous WAT. c, A comparison of the rodent brown, beige and white 
markers that have been found to be enriched in particular human adipose 
tissue depots. See Supplementary Tables 1 and 2 for more detailed information. 
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steeply towards the innermost layers accompanied by elevated expression 
of the rodent brown-adipocyte markers ZIC1 and LHX8 only in the deep- 
est locations”. By contrast, layers at intermediate depth with moderate 


UCPI expression are enriched with beige markers”. 


Physiological importance of human BAT 

Efforts to characterize human BAT ona molecular level have been accom- 
panied by studies investigating the role of BAT in energy balance and 
metabolism. Estimates of BAT mass and activity from positron emis- 
sion tomography with '*F-fluoro-2-deoxy-p-glucose integrated with 
computed tomography ('*FDG-PET-CT) suggest that humans have, 
on average, 50-80 g of BAT”. This figure divides opinions on whether 
endogenous BAT activity could be relevant to human energy balance 
and metabolism. However, '*FDG-PET-CT does not directly measure 
either oxidative metabolism or thermogenesis in BAT as it only reports 
the uptake of the minor oxidative substrate glucose. At room temperature, 
®EDG-PET-CT usually detects BAT in less than 10% of adult humans, 
but the detection rate increases following mild cold exposure before scan- 
ning®*. Therefore, it is possible that 1SEDG-PET-CT studies underesti- 
mate the prevalence and volume of BAT in the population, particularly 
as some subjects do have UCP1* adipocytes in their supraclavicular BAT 
despite not accumulating “FDG'™. Furthermore, this technique cannot 
distinguish clusters of thermogenic adipocytes in WAT’. 

Despite the limitations of 'SEDG-PET-CT, BAT activity does seem to 
have a positive relationship with resting metabolic rate (RMR) in healthy 
adult males, although only at cooler temperatures (22 °C compared with 
27°C)*", supporting a role for BAT in whole-body energy expenditure. 
In addition, multiple studies using this technique report that the likeli- 
hood of BAT detection and BAT activity decrease with increasing body 
mass index, per cent body fat, age and plasma glucose levels®*""*""*. In 
summary, it would seem that BAT activity is decreased in the metaboli- 
cally ‘unhealthy’ states of obesity and hyperglycaemia, whereas increased 
BAT activity is associated with a lean, normoglycaemic, ‘healthy’ meta- 
bolic profile. However, in these associations it is unclear which is the cause 
and which is the consequence. 


Activation and recruitment of human BAT 

The short-term, mild-cold-exposure protocol (exposure to temperatures 
between 15-18 °C for up to 6 hours) used by many human BAT studies 
has been shown to increase the skin temperature over the supraclavicular 
BAT depot, and to increase glucose uptake, fatty-acid uptake, oxidative 
metabolism and blood flow in this anatomical area’””’”"*. These changes 
are consistent with the activation of human BAT. Furthermore, two recent 
studies investigated whether chronic cold exposure is able to induce BAT 
recruitment by measuring BAT activity in response to a single episode of 
cold exposure before and after cold acclimation’*""*. Both studies showed 
that a daily regime of cold exposure — either 2 hours at 17°C daily for 
6 weeks or up to 6 hours at 15-16 °C daily for 10 days — increased glucose 
uptake into BAT'’*""’, and one study even reported a 37% increase in 
estimated BAT volume’. Taken together, this is promising evidence that 
human BAT is an active, highly plastic ‘trainable’ tissue that is responsive 
to physiological stimuli. 


Cellular models of human adipose tissue 

Just as murine cellular models have proved invaluable to the investigation 
of adipocyte differentiation and functional regulation, human adipocyte 
cellular models will be key to translating therapeutic strategies to humans. 
First, in vitro culture experiments have succeeded in demonstrating that 
human BAT is more metabolically active than WAT. The oxygen con- 
sumption rates of adipocytes from human supraclavicular BAT and deep 
neck fat biopsies were found to be higher compared with adipocytes from 
human subcutaneous WAT biopsies’ "7 Tn addition, human cervical 
fat adipocyte precursors have been differentiated in vitro into beige-like 
cells using a brown adipogenic cocktail’'®. These beige-like cells demon- 
strated higher basal oxygen consumption rates and increased uncoupling 
compared with white adipocytes derived from the same precursors using 
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a white adipogenic cocktail''*. Furthermore, the oxygen consumption 
rate of the beige-like cells could be further increased on stimulation with 
noradrenaline’*. 

Second, in vitro models have already been used to demonstrate that 
alternative activators of rodent thermogenic adipocytes could be relevant 
to humans. For example, CNPs induce lipolysis and thermogenic gene 
expression in human white adipocytes”””. In addition, UCP1 expression 
and uncoupled respiration can be increased in white adipocytes derived 
from human subcutaneous WAT and neck fat by treatment with FGF21 
or irisin’’*"”°. These results suggest that these molecules are physiologi- 
cal thermogenic activators in humans, which is also supported by recent 
evidence that circulating levels of FGF21 and irisin are increased with 
cold exposure in humans’), 

Although primary culture and differentiation of human adipocyte 
precursors is the gold standard for interrogating adipogenic regulatory 
networks and testing agents that might modulate adipocyte function, 
these cells are difficult to obtain and have a limited lifetime in culture. 
Recent advances in stem-cell technology have allowed the field to move 
towards human embryonic stem cell and induced pluripotent stem (iPS) 
cell differentiation models, which can provide an unlimited source of 
human white, brown and beige adipocytes, and their intermediate pre- 
cursors'*~’”*, Such genetically engineerable models are valuable tools 
for building a comprehensive picture of the regulatory networks that 
determine brown-, beige- and white-adipocyte fate in humans. Per- 
haps more importantly, they will enable high-throughput screening of 
candidate agents for increasing BAT mass and/or activity, and for beige 
adipocyte recruitment. Furthermore, adipocytes derived from clinical 
grade iPS cells might even be reintroduced into the body. The use of 
BAT transplantation as a method to increase BAT mass has already been 
tested in rodent models, and could be viable given adequate innerva- 
tion and vascularization of the transplanted material!**!*°, Therefore, 
cellular models of human adipocytes are valuable tools that might help 
to bridge the gap between rodents and humans, from drug discovery 
to transplantation. 


Future perspectives 

We propose that the metabolic toxicity that drives the pathology of obesity 
and metabolic syndrome could be reduced by improving the nutrient 
storage capacity of WAT and/or by maximizing the energy-dissipating 
potential of thermogenic brown and beige adipocytes. In our opinion, 
progress in using BAT to treat metabolic disease requires a better under- 
standing of the molecular nature of the adipocytes in human BAT, which 
will provide valuable guidance for the translation of strategies to activate 
brown or beige cells in rodent models into a clinical context. In particu- 
lar, the development of appropriate human cellular models to enable the 
screening of potential pharmacological activators is an important first step 
on the predictably long and winding road to clinical success. = 
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The role of hepatic lipids in hepatic 
insulin resistance and type 2 diabetes 


Rachel J. Perry’, Varman T. Samuel”, Kitt F. Petersen’’ & Gerald I. Shulman’*** 


Non-alcoholic fatty liver disease and its downstream sequelae, hepatic insulin resistance and type 2 diabetes, are rapidly 
growing epidemics, which lead to increased morbidity and mortality rates, and soaring health-care costs. Developing 
interventions requires a comprehensive understanding of the mechanisms by which excess hepatic lipid develops and 
causes hepatic insulin resistance and type 2 diabetes. Proposed mechanisms implicate various lipid species, inflammatory 
signalling and other cellular modifications. Studies in mice and humans have elucidated a key role for hepatic diacylglycerol 
activation of protein kinase Ce in triggering hepatic insulin resistance. Therapeutic approaches based on this mechanism 
could alleviate the related epidemics of non-alcoholic fatty liver disease and type 2 diabetes. 


odern global health care faces challenges that are drastically 
M different from past generations, largely owing to the increas- 

ing worldwide prevalence of obesity. This is exemplified by a 
change in focus to centre on obesity-related liver disease. Although viral 
hepatitis continues to be an important health concern, non-alcoholic fatty 
liver disease (NAFLD) is the now most common liver disorder in the 
Western world, where the rates of adult and paediatric obesity have soared 
to an estimated 20-30% of the US population’. In east and south Asian 
communities, NAFLD is also on the rise, with estimates that its prevalence 
reaches as high as 60% in urban areas”. Startlingly, NAFLD has been 
found to be highly prevalent among young lean south Asian Indians”®. 

A strong association between NAFLD and type 2 diabetes has been 
demonstrated: more than 90% of obese patients with type 2 diabetes have 
NAFLD’. Insulin resistance is common in both conditions’. Patients 
with NAFLD almost universally have hepatic insulin resistance, which 
increases the risk of impaired fasting glucose and type 2 diabetes**"'. In 
addition, a subset of patients with NAFLD will develop non-alcoholic 
steatohepatitis (NASH) with histological changes such as steatosis, lobu- 
lar inflammation and/or hepatocellular ballooning’. Around 20% of 
patients with NASH will progress to liver cirrhosis and liver failure’*™*. 
NASH-associated cirrhosis is now the third most common indication 
for liver transplantation in the United States'*. Health policies that can 
prevent NAFLD and new treatments that can reverse the disease will offer 
tremendous benefits, in terms of both lives saved and health-care costs. 

Thus, in this Perspective we will discuss the link between hepatic lipid 
accumulation and hepatic insulin resistance and focus on the role of 
diacylglycerol, a lipid metabolite that activates novel protein kinase C iso- 
forms (PKCs) and thereby impairs insulin signalling, in the pathogenesis 
of lipid-induced hepatic insulin resistance. Although several other mecha- 
nisms have been proposed to explain this association, these alternatives 
have been reviewed elsewhere’. As we will discuss here, diacylglycerol- 
induced novel PKC activation has emerged as a common mechanism to 
explain the development of insulin resistance in liver and skeletal muscle 
in a variety of experimental and clinical models. 


Molecular mechanism of lipid-induced insulin resistance 
Insulin action requires a coordinated, intricate relay of intracellular sig- 
nals, involving mostly phosphorylation and dephosphorylation events. 


In the canonical view of hepatic insulin signalling, insulin binds and 
activates the insulin receptor tyrosine kinase (IRTK), which in turn pro- 
motes tyrosine kinase phosphorylation of insulin receptor substrates 
(IRS), most importantly IRS2 in the liver (Fig. 1)”. Phosphorylation 
of IRS2 generates binding sites for Src homology 2 domain proteins, 
including phosphatidylinositol-3-OH kinase (PI(3)K)'*. The binding 
of PI(3)K to IRS2 recruits phosphatidylinositol-3,4,5-trisphosphate 
(PtdIns(3,4,5)P,), which in turn recruits Akt’’. Under insulin-stimulated 
conditions, 3-phosphoinositide-dependent kinase-1 phosphorylates and 
activates Akt, which is thought to suppress hepatic glucose production 
through two key mechanisms: first, decreased expression of gluconeo- 
genic enzymes by phosphorylation and nuclear exclusion of the fork- 
head box protein FOXO1 and its pro-gluconeogenic targets, and second, 
activation of glycogen synthase by phosphorylation and inactivation of 
glycogen synthase kinase-3f. Although this relatively linear construct 
is useful for interrogating insulin signalling in experimental models, it 
fails to capture the interwoven mechanisms that have evolved to regulate 
hepatic glucose and lipid metabolism. For example, although acute insulin 
signalling following a meal can decrease messenger RNA expression of 
gluconeogenic enzymes, it probably does not acutely alter the protein 
levels of these enzymes. Gluconeogenic enzymes are also conventionally 
thought to be subject to allosteric activation: acetyl coenzyme A (acetyl- 
CoA) activates pyruvate carboxylase”, and fructose-2,6-bisphosphate 
inhibits fructose-1 ,6-bisphosphatase™. And, although insulin might acti- 
vate glycogen synthesis, glucose is necessary to inhibit glycogen phos- 
phorylase and effectively promote net hepatic glycogen synthesis”. 
The development of NAFLD is strongly associated with hepatic insulin 
resistance. This relationship is most apparent when NAFLD is induced 
in rats after just 3 days of being fed a high-fat diet. The ability of insulin 
to suppress hepatic glucose production is diminished in this model even 
without changes in body weight, adiposity or muscle insulin resistance”. 
Hepatic insulin resistance in this model was associated with increased 
hepatic diacylglycerol content and increased translocation of the primary 
novel PKC isoform in liver, protein kinase-Ce (PKCe)**”, to the plasma 
membrane at which it was found to bind and inhibit the activity of the 
intracellular kinase domain of the insulin receptor. This was associated 
with reduced insulin-stimulated phosphorylation of IRS2 and IRS2- 
associated PI(3)K activity and phosphorylation of Akt2. Consequently, 
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the ability of insulin to activate glycogen synthesis and inhibit gluconeo- 
genesis was impaired (Fig. 1)”°. The crucial role of PKCe in mediating 
lipid-induced hepatic insulin resistance has been convincingly demon- 
strated by knocking down expression of PKC¢e in the liver of rodents. 
Rats treated with an antisense oligonucleotide (ASO) to decrease hepatic 
expression of PKCe were protected from lipid-induced hepatic insulin 
resistance, although hepatic diacylglycerol and triglyceride content were 
unchanged”®. Furthermore, these animals were found to have preserved 
IRTK activity with intact signalling through downstream proteins. Simi- 
larly, Prkce whole-body knockout mice are also protected from lipid- 
induced insulin resistance”. 

This model for lipid-induced hepatic insulin resistance has been trans- 
lated to humans. Potential mechanisms for hepatic insulin resistance were 
assessed in a group of patients undergoing bariatric surgery. Although 
the participants were all obese, there was a significant variation in insulin 
resistance. Notably, individuals with very similar body mass index could 
manifest markedly different degrees of insulin resistance. By contrast, 
hepatic diacylglycerol content and PKCe activation were the strongest 
predictors of hepatic insulin resistance in liver biopsies obtained from 
these individuals”. There was no association between insulin sensitiv- 
ity and other factors implicated in causing hepatic insulin resistance, 
including ceramide content, endoplasmic reticulum (ER) stress markers 
or inflammatory cytokine concentrations. These results were replicated 
in another study showing that hepatic diacylglycerol content was the 
best predictor of hepatic insulin resistance in obese humans, whereas 
there was no association with hepatic ceramide content or markers of 
inflammation”. Indeed, hepatic inflammation has been suggested to be 
aconsequence, not a cause, of insulin resistance”. Thus, although excess 
calorie intake certainly leads to obesity, only those who develop hepatic 
steatosis will develop insulin resistance. These data argue that the key step 
in the pathogenesis of hepatic insulin resistance is the accumulation of 
hepatic diacylglycerol leading to activation of PKCe. Experimental models 
of altered hepatic lipid content allow us to further test this mechanism and 
could also inform the development of potential therapeutic interventions. 

The model outlined focuses on the inability of insulin to alter hepatic 
glucose metabolism. Despite this, the ability of insulin to activate lipo- 
genesis seems to be intact in most models of NAFLD. Much has been 
written about the paradox of selective insulin resistance, with investiga- 
tors proposing the existence of branch points in the insulin signalling 
pathway”, but space limitations preclude discussion of selective insulin 
resistance in this Perspective. Instead, we will discuss the mechanisms that 
govern hepatic lipid accumulation and its relationship to insulin’s ability 
to control hepatic glucose metabolism. 


Regulation of fat delivery to liver 
Hepatic lipid content is regulated by the balance between hepatic lipid 
uptake, synthesis, oxidation and export (Fig. 2). Hepatic lipid uptake is 
a function of substrate delivery and transport into the hepatocyte, and 
several genetic models exemplify this aspect of hepatic lipid metabolism. 
Transgenic mice with liver-specific overexpression of lipoprotein lipase 
(LpL) develop liver-specific lipid accumulation and liver-specific insulin 
resistance, whereas transgenic mice with muscle-specific overexpression 
of LpL develop muscle-specific lipid accumulation and muscle-specific 
insulin resistance™. In these models, tissue-specific insulin resistance fol- 
lowed ectopic lipid accumulation. Ina similar example, mice that lack the 
primary fatty-acid transporter in hepatocytes, FATPS, are protected from 
diet-induced NAFLD, indicating that excess fatty-acid transport into the 
hepatocyte is required for NAFLD and hepatic insulin resistance”. 
Studies in mice and humans have implicated adipose tissue lipolysis 
as an important source of fatty acids that promote NAFLD and hepatic 
insulin resistance. Whole-body lipolysis increases with total fat mass 
in humans***’; however, the relationship between lipolysis and insulin 
sensitivity seems to be largely independent of body mass. For example, 
insulin-resistant obese adolescents have higher visceral fat content than 
their weight-matched, insulin-sensitive counterparts”. The relationship 
between adipose lipolysis and hepatic lipid content is exemplified by 
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Figure 1 | Molecular mechanism by which excess diacylglycerol leads 

to hepatic insulin resistance and hyperglycaemia. Increases in liver 
diacylglycerol (DAG) cause protein kinase Ce (PKCe) activation and 
translocation to the cell membrane, which results in inhibition of insulin 
signalling. Reduced phosphorylation of insulin receptor substrate-2 (IRS2) 
and PI(3)K impairs Akt2 activity by reductions in 3-phosphoinositide- 
dependent protein kinase 1 (PDK1) activity, suppressing glycogen synthase 
kinase-3 (GSK3) phosphorylation and reducing insulin-stimulated liver 
glycogen synthesis through reduced glycogen synthase (GS) activity. Impaired 
Akt2 activity also reduces insulin suppression of hepatic gluconeogenesis 
by promoting Forkhead box protein O1 (FOXO1) translocation to the 
nucleus due to reduced phosphorylation and increasing expression of the 
gluconeogenic proteins pyruvate carboxylase (PC), phosphoenolpyruvate 
carboxykinase (PEPCK), glucose-6-phosphatase (G6Pase). PIP3, 
phosphatidylinositol (3,4,5)-triphosphate. 


manipulation of the genes that regulate adipose lipolysis. As in humans, 
the effect of lipolysis on insulin sensitivity in rodents seems to be inde- 
pendent of body weight. Obese mice lacking the fatty-acid-binding 
protein FABP in adipocytes are more insulin sensitive than their obese 
littermates with normal FABP”. Conversely, leptin-deficient obesity- 
prone mice with increased rates of adipocyte lipolysis due to knockout of 
the gene encoding adipocyte phospholipase A2 show increases in ectopic 
lipid storage and insulin resistance despite reduced body weight compared 
with littermates with normal lipolytic rates”’. These data suggest a facilita- 
tive role for the increases in adipose tissue lipolysis in providing substrates 
for ectopic lipid deposition and insulin resistance. 

These genetic rodent models inform our understanding of human 
disease. Patients with conditions resulting from mutations in LpL (for 
example, hyperlipoproteinaemia type 1) are prone to developing insu- 
lin resistance*. In humans, the single nucleotide polymorphism (SNP) 
1856225452, putatively representing a gain-of-function mutation in the 
FATP5 promoter, was associated with insulin resistance and NAFLD”. 
In Asian Indian individuals, as well as those of other ethnic groups, vari- 
ants (C-482T, T-455C or both) in apolipoprotein C3 (APOC3), which 
can inhibit LpL and hepatic lipase, are associated with hypertriglyceri- 
daemia and NAFLD”. These polymorphisms led to around 30% higher 
plasma APOC3 concentrations and post-prandial hypertriglyceridaemia 
through the inhibitory effect of APOC3 on LpL activity. As a result, the 
livers of individuals with APOC3 variants take up a greater amount of 
lipid from chylomicrons, remnants of lipoprotein particles, predisposing 
these lean subjects to NAFLD and hepatic insulin resistance. These results 
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Figure 2 | NAFLD develops due to an imbalance between lipid supply 

and demand. Fatty acids (FAs) derived from lipolysis and from chylomicron 
remnants are taken up through fatty-acid transport proteins (FATPs), mainly 
FATP2 and FATP5 in the liver; chylomicron remnants are also taken up through 
the low-density lipoprotein (LDL) receptor. A small fraction of intracellular fatty 
acid supply in the liver also comes from de novo lipogenesis in the cytosol. Fatty 
acids can also be re-esterified to lysophosphatidic acid (LPA) by acyl-coenzyme 
A (AcCoA) and the conversion of glycerol 3-phosphate (G3P) by either 
mitochondrial glycerol-3-phosphate acyltransferase (mtGPAT) or microsomal 
GPAT (msGPAT). Fatty-acyl CoAs (shown here as phosphatidic acid, PA) 
formed by 1-acylglycerol-3-phosphate O-acyltransferase-2 (AGPAT2) are then 
added to the glycerol backbone by phosphatidic acid phosphatase (PAP) to 
generate diacylglycerol (DAG), and by diacylglycerol acyltransferases (DGAT) 
to generate triacylglycerol (TAG). Increased DAG causes protein kinase Ce 
(PKCe) translocation to the cell membrane, which inhibits insulin signalling. 
Lipids may also be sequestered in lipid droplets as monoacylglycerol (MAG), 
DAG and TAG, but these are not thought to be responsible for hepatic insulin 
resistance. By inhibition of adipose triglyceride lipase (ATGL), comparative 
gene identification-58 (CGI-58) bound to perilipin is mainly responsible for 
lipid sequestration in the droplet. By contrast, intracellular hepatic lipid content 
is reduced by two mechanisms: mitochondrial fatty acid oxidation and export 
from the smooth endoplasmic reticulum (SER) as very-low-density lipoprotein 
(VLDL). HSL, hormone-sensitive lipase; MTTP, microsomal triglyceride 
transfer protein. 


were replicated in another cohort oflean males of European descent“. Of 
note, this APOC3-gene-environment interaction has only been observed 
in lean males, probably reflecting a protective effect of oestradiol on the 
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ability of APOC3 to inhibit LpL and promote ectopic fat storage”, and 
the ability of obesity-associated NAFLD“ to mask the relatively subtle 
affect of this gene-environment interaction on the development of hepatic 
insulin resistance”. 

The effect of increased plasma Apoc3 concentrations on the devel- 
opment of NAFLD and hepatic insulin resistance has been genetically 
validated in transgenic mice that have increased hepatic overexpres- 
sion of Apoc3. These mice are more prone to diet-induced NAFLD 
and diacylglycerol-PKCe-induced hepatic insulin resistance than their 
wild-type littermates™. Interestingly, although hypertriglyceridaemia was 
present in transgenic mice fed both a normal and high-fat diet, severe 
hepatic steatosis and hepatic insulin resistance only developed in Apoc3 
transgenic mice fed a high-fat diet, reflecting an important gene-environ- 
ment interaction. Moreover, the phenotype was due to both inhibition of 
peripheral lipase activity and diminished hepatic triacylglycerol export 
as very-low-density lipoprotein (VLDL)*. 


Lessons learned from lipodystrophy 

The importance of adipose tissue lipid storage is exemplified when 
adipose tissue is altogether absent. In ‘fatless’ mice expressing the 
dominant-negative protein A-ZIP/F-1 in adipocytes, the absence of 
visceral and peripheral fat leads to ectopic lipid accumulation and 
severe hepatic and muscle insulin resistance. Insulin resistance can 
be corrected by transplantation of white adipose tissue from normal 
mice, further illustrating the importance of adipose tissue as a ‘safe’ 
storage depot”. Lipoatrophic mice with the gene encoding peroxisome 
proliferator-activated receptor-y (Ppar-y) knocked out in white adipose 
tissue or with the gene encoding hepatic 1-acylglycerol-3-phosphate 
O-acyltransferase 2 (Agpat2) knocked out globally display a similar 
phenotype: the loss of visceral and subcutaneous fat is associated with 
hepatic steatosis and hepatic insulin resistance”. 

Similar associations are evident in humans with lipodystrophies. 
Patients with these disorders represent a rare example of severe hepatic 
insulin associated with extreme hepatic steatosis in the absence of vis- 
ceral or peripheral fat accumulation” ™. Leptin treatment can decrease 
calorie intake and effectively normalize hepatic lipid content and hepatic 
insulin action®™. Similarly, patients with partial lipodystrophy owing to 
mutations in the scaffolding protein perilipin-1, which inhibits adipose 
triglyceride lipase, have reduced peripheral fat mass but develop NAFLD 
because of increased adipose tissue lipolysis resulting from inhibition 
of adipose tissue triglyceride lipase”’. These patients can also develop 
profound insulin resistance. These data again point to ectopic lipid accu- 
mulation in the liver, which might occur as a result of the diversion of 
substrates from other storage depots, as the crucial mediator of hepatic 
insulin resistance. 


Regulation of hepatic lipid synthesis 

Hepatic triacylglycerol synthesis is the sum of two main processes: the 
synthesis of fatty acids (de novo lipogenesis, DNL) and esterification of 
fatty acids into fatty-acid glyceride species (for example, mono-, di- and 
triacylglyceride). 


Contributions of de novo lipogenesis to triacylglycerol synthesis 

Although DNL is thought to make a relatively small contribution to 
hepatic triacylglycerol accumulation relative to esterification”, rates 
of postprandial DNL do increase significantly in both young and elderly 
patients with NAFLD*™. Diet-induced NALFD might stimulate a 
feed-forward loop exacerbating DNL and ectopic lipid deposition. For 
example, fructose-fed hamsters have hypertriglyceridaemia, NAFLD 
and insulin resistance associated with increased DNL", and because 
fructose inhibits fatty-acid oxidation both directly and indirectly, excess 
fructose intake is likely to stimulate DNL and hepatic insulin resist- 
ance”. However, reducing lipogenic gene expression by knockdown 
of the upstream gene encoding peroxisome proliferator-activated recep- 
tor y coactivator-1$ (PGC-18)™ protects against fructose-induced hepatic 
insulin resistance”. Similarly knockdown of the genes encoding the acetyl 
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CoA carboxylases ACC1 and ACC2, which are crucial in the regulation 
of DNL and lipid oxidation, respectively, reduced liver triacylglycerol and 
diacylglycerol content, decreased PKCe activation and protected mice 
from lipid-induced hepatic insulin resistance™. 


Skeletal muscle insulin resistance promotes hepatic lipogenesis 
Skeletal muscle insulin resistance typically accompanies insulin resistance 
at other sites, possibly because of the diversion of substrates from insulin- 
resistant muscle to the liver (Fig. 3). The independent effect of muscle 
insulin resistance to exacerbate NAFLD has also been demonstrated in 
rodents. Mice that lack insulin-responsive glucose transporter 4 (Glut4) 
in muscle have NAFLD”. Similarly, mice lacking the Akt substrate As160 
have decreased glucose uptake in adipose tissue and slow-twitch muscles, 
and develop insulin resistance in the liver as well as adipose tissue and 
skeletal muscle®. Muscle insulin resistance might also be independently 
associated with NAFLD. For example, the severity of NAFLD in mice 
fed a high-fat, high-cholesterol diet was demonstrated to correlate with 
peripheral insulin resistance, and not with hepatic insulin resistance”. 

These results have been translated to humans, in which selective muscle 
insulin resistance in healthy young lean individuals has been shown to 
predispose them to increased hepatic DNL, hepatic triacylglycerol accu- 
mulation and atherogenic dyslipidaemia after eating high-carbohydrate 
meals. This is because ingested glucose is diverted away from muscle 
glycogen storage to the liver, in which it is converted to triacylglycerol 
driven by the compensatory hyperinsulinaemia that is secondary to mus- 
cle insulin resistance”. Further evidence to support this hypothesis stems 
from a study demonstrating that a single 45 minute bout of exercise on an 
elliptical trainer increased postprandial muscle glycogen synthesis follow- 
ing carbohydrate ingestion, resulting in a 40% reduction in hepatic DNL 
and a 30% reduction in hepatic triglyceride synthesis”. 


Fatty acid esterification contributes to triacylglycerol synthesis 
Most liver triglyceride is formed through esterification of fatty acids”. 
Diacylglycerol is an intermediate in the esterification pathway and, thus, 
genetic models that manipulate hepatic lipid esterification can be used 
to further examine the diacylglycerol-novel-PKCs hypothesis of insulin 
resistance. Rats overexpressing the rate-controlling enzyme in triglycer- 
ide esterification, mitochondrial glycerol-3-phosphate acyltransferase 1 
(GPAT1), have hepatic insulin resistance associated with increased PKCe 
activity’. These data are in contrast to mice in which the gene encoding 
mitochondrial GPAT was knocked down. These animals exhibit suppres- 
sion of PKC activity and improved hepatic insulin sensitivity”, offering 
further evidence in support of the diacylglycerol-PKCe hypothesis of 
hepatic insulin resistance. 

Diacylglycerol acyltransferase 2 (DGAT2) catalyses the final step in 
triglyceride synthesis from diacylglycerol. Although inhibition of DGAT2 
may be expected to acutely increase cellular diacylglycerol content, 
chronic reduction in hepatic DGAT2 expression due to ASO treatment 
results in decreased hepatic diacylglycerol content due to downregula- 
tion of the lipogenic pathway. Consistent with the diacylglycerol-PKCe 
hypothesis this reduction in hepatic diacylglycerol content was associated 
with reduced PKC¢ activation and protection from lipid-mediated hepatic 
insulin resistance’”*”*. 

The effects of hepatic overexpression of DGAT2 are less clear. Monetti 
et al. reported that although transgenic mice had increased hepatic dia- 
cylglycerol content, there was no impact on hepatic insulin sensitivity”. 
Jornayvaz et al. also demonstrated an increase in hepatic diacylglycerol 
content in the same mice, but reported increased PKCe activation, 
decreased hepatic insulin signalling and hepatic insulin resistance”. 
Technical differences in study execution could explain this difference. In 
hyperinsulinaemic—euglycaemic clamp studies to test insulin resistance, 
both groups found that DGAT2 transgenic mice fed a normal diet failed 
to normally suppress hepatic glucose production. However, the teams 
reported differing results for wild-type mice fed a normal diet. Jornay- 
vaz et al. showed that these control mice had normal insulin suppres- 
sion of hepatic glucose production, whereas Monetti et al. reported that 
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Figure 3 | Mechanism by which selective skeletal muscle insulin resistance 
contributes to hepatic insulin resistance. In insulin-sensitive subjects, 
insulin stimulates glycogen synthesis in both liver and muscle; however, in 
those with skeletal muscle insulin resistance, insulin fails to promote glycogen 
synthesis, diverting substrate to de novo lipogenesis. Increased lipid synthesis 
in patients with muscle insulin resistance thus produces non-alcoholic fatty 
liver disease (NAFLD), with increased triglyceride and reduced high-density 
lipoprotein (HDL) export from the liver. However, these defects in muscle 
insulin signalling can be reversed by a single 45 minute bout of exercise. 


control mice did not suppress hepatic glucose production. Thus, the key 
to interpreting the phenotype attributed to increased hepatic DGAT2 
expression and diacylglycerol accumulation in this model is whether or 
not the control animals fed a normal diet had complete suppression of 
hepatic glucose production in response to insulin. Ina different study in 
humans, SNPs in DGAT2 predicted a smaller decrease in liver fat content 
after very modest (3 kg) weight loss compared with individuals without 
SNPs in DGAT2. However, insulin signalling was not measured in these 
studies, and the participants’ very modest weight loss may prevent us from 
uncovering meaningful information about the role of DGAT2 (ref. 77). 
Further studies are needed to understand the role of DGAT2 in hepatic 
insulin sensitivity and its potential link to whole-body adiposity. 


Lipid export 

Export of triglyceride as VLDL is the only means of reducing hepatic 
lipid content other than through fat oxidation. NAFLD-associated insulin 
resistance is seen in genetic models of impaired VLDL export”, whereas, 
in models of increased VLDL export, NAFLD is ameliorated, independent 
of body weight”*°. As previously discussed, impaired hepatic lipid export 
also contributes to the development of NAFLD and hepatic insulin resist- 
ance in mice overexpressing ApoC3 (ref. 48). 


Regulation of hepatic lipid oxidation 

Rodent models with altered fat oxidation can be used to both test rela- 
tionships between diacylglycerol, PKCe activation and insulin resist- 
ance, and to validate potential therapeutic targets. Mice lacking Ppar-a, 
a key regulator of hepatic lipid oxidation, are prone to NAFLD and fail 
to benefit from the insulin sensitizing effects of omega-3 fatty acids®’. 
Mice that are genetically deficient in the dehydrogenase LCAD have 
diminished mitochondrial fatty-acid oxidation and increased de novo 
diacylglycerol synthesis, increased liver PKCe activity and hepatic insulin 
resistance”. Interestingly, humans and mice with loss-of-function muta- 
tions in LCAD exhibit fasting hypoglycaemia, which has been ascribed to 
defects in amino-acid metabolism and may not be related to alterations 
in hepatic insulin sensitivity’. Hypoglycaemia has also been dissociated 
from insulin resistance in mice fed a ketogenic diet. These mice develop 
profound hepatic insulin resistance associated with increased diacyl- 
glycerol concentrations and PKCe activation, despite hypoglycaemia 
stemming from reductions in fasting hepatic gluconeogenesis™. 
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Changes in whole-body energy metabolism also affect hepatic lipid bal- 
ance. Thyroid hormone receptor-a knockout mice have increased energy 
expenditure (as well as decreased expression of lipogenic enzymes) and 
are protected from diet-induced NAFLD, PKCe activation and hepatic 
insulin resistance®. Similarly, infusion of fibroblast growth factor 21 
(Fgf21) in mice resulted in increased energy expenditure in the liver 
and white adipose tissue, lowered liver diacylglycerol concentrations, 
decreased hepatic PKCe translocation and protection from lipid-induced 
hepatic insulin resistance®’. By contrast, mice deficient for fatty acid amide 
hydrolase (EAAH) have reduced whole-body energy expenditure due to 
diminished hypothalamic-pituitary axis activity and impaired thyroid 
function. The absence of FAAH resulted in increased liver diacylglycerol, 
increased PKCe translocation and hepatic insulin resistance”. These 
data again indicate that the link between NAFLD and insulin resistance 
is an increase in diacylglycerol concentration leading to activation of 
PKCe activity. 

As already mentioned, inhibition of ACC1 and ACC2 leads to both 
a decrease in lipid synthesis and an increase in lipid oxidation. The lat- 
ter effect is due to the disinhibition of carnitine palmitoyltransferase-1 
(CPT1) and more long-chain fatty acyl CoAs entering the mitochon- 
dria. Mutation of key serine residues in Acc] and Acc2 in mice prevents 
inactivation of these enzymes by AMP-activated protein kinase (AMPK), 
leading to increased lipid synthesis and decreased lipid oxidation®. These 
mice have increased hepatic diacylglycerol content, activation of PKCe 
and develop hepatic insulin resistance. By contrast, AMPK is activated 
in mice with a deletion of the mitochondrial gene encoding the sodium- 
dicarboxylate cotransporter Slc13a5, the mammalian homologue of the 
INDY protein in Drosophila. The increased AMPK activity leads to reduc- 
tions in Accl and Acc2 activity, resulting in decreased liver diacylglycerol 
content and reduced PKCé translocation. Together, these changes pro- 
tected Slc13a5 knockout mice from both diet- and age-associated hepatic 
insulin resistance”. 

Many groups have studied whether people with NAFLD have altera- 
tions in hepatic oxidative flux either as a cause or consequence of ectopic 
lipid accumulation. Results in humans have been mixed. When an indi- 
rect tracer method was used to assess hepatic tricarboxylic acid (TCA) 
cycle flux and anaplerosis, both were found to be markedly increased in 
individuals with NAFLD”™””. But when a *'P magnetic resonance spec- 
troscopy technique was used to assess hepatic ATP production, a reduc- 
tion in hepatic energy metabolism was found in individuals with type 2 
diabetes”””’. Because the role of hepatic oxidative flux is key to both the 
understanding of pathogenesis and potential treatment of NAFLD and 
type 2 diabetes, the development of methods to directly measure rates 
hepatic oxidative metabolism is of crucial importance”. 


Dissociation of NAFLD and hepatic insulin resistance 

The requirement of NAFLD for hepatic insulin resistance has been ques- 
tioned by several groups and has been recently reviewed”. It is well 
established that it is possible to experimentally induce insulin resistance 
without NAFLD, or induce NAFLD without insulin resistance under 
certain conditions. For instance, blocking hepatic VLDL secretion with 
acholine-deficient diet or by genetic modification of the export machin- 
ery increases hepatic triglyceride concentrations but does not cause 
insulin resistance*”’. Similarly, mice with liver-specific knockout of 
the gene encoding the protein phosphatase Shp1 develop NAFLD, but 
are protected from insulin resistance’”’. Hepatic carbohydrate respon- 
sive element-binding protein and sterol regulatory element-binding 
protein-1 have recently been independently implicated in dissociating 
NAELD and insulin resistance in mice and humans'*!""™. In addition, 
mice with the gene encoding the lipase activator CGI-58 knocked down 
have profound hepatic steatosis due to suppression of hepatic triglyc- 
eride lipolysis. However, these mice are protected against high-fat-diet- 
induced hepatic insulin resistance’. To explain this paradox, the authors 
of one study examined the subcellular localization of diacylglycerols in 
liver cells and found that knockdown of the gene encoding CGI-58 pro- 
moted diacylglycerol accumulation in lipid droplets, while protecting 
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against diacylglycerol accumulation in the cell membrane and preventing 
PKCe translocation to the cell membrane™. These results are consistent 
with earlier studies in humans, also showing that compartmentation of 
diacylglycerols in the cytosolic” and membrane'® compartments could 
be an important factor in the pathogenesis of liver and muscle insulin 
resistance. The different results between diacylglycerol accumulation in 
the membrane and the cytosolic compartments might reflect differences 
in the measurement and fractionating or lipid-extraction techniques used 
in the different studies and/or in the length of fasting. Nevertheless, the 
CGI-58 ASO data clearly indicate that lipids sequestered in lipid droplets 
do not promote PKC¢e activation and hepatic insulin resistance, and that 
this compartmentation of diacylglycerols and triacylglycerols in lipid 
droplets may explain other models of NAFLD that are not associated 
with hepatic insulin resistance, as discussed later. Future work will better 
discern the importance of specific lipid compartments in the pathogenesis 
of insulin resistance. 

Similarly, liver-specific Hdac3 knockout mice are prone to develop- 
ing hepatic steatosis when fed both normal and high-fat diets and have 
been shown to be protected from lipid-induced hepatic resistance’. 
This apparent disconnect is probably also due to altered partitioning of 
diacylglycerols to the lipid droplet. Loss of Hdac3 in mice was associ- 
ated with an increase in the lipid droplet protein Plin2 and decreased 
activation of PKCe. When Plin2 expression was normalized with a 
specific ASO, the improvements in glucose tolerance were no longer 
evident. Taken together these data strongly suggest that diacylglyc- 
erols need access to a particular subcellular compartment to inhibit 
IRTK activity, whereas diacylglycerol accumulation in the lipid drop- 
let is protective and does not lead to inhibition of IRTK activity and 
insulin resistance. 

Genetic variants in the triglyceride hydrolase PNPLA3 (also known 
as adiponutrin) have been proposed to disassociate NAFLD from 
insulin resistance in normal weight and obese humans“. However, this 
conclusion was based on fasting plasma glucose and insulin measure- 
ments, and hepatic insulin responsiveness was not directly assessed 
in this study. Furthermore, because many of the subjects in this study 
were obese and relatively insulin resistant it is difficult to determine if 
PNPLA3 could exacerbate hepatic insulin resistance in these individu- 
als. Additional insights into the role of PNPLA3 have been gained using 
an ASO to decrease expression of PNPLA3 in rats’”’. Reduced expres- 
sion of PNPLA3 decreased hepatic lipid content owing to decreased 
lipid esterification, pointing to the potential duality of PNPLA3’s func- 
tions in both triglyceride synthesis and hydrolysis’™. Similarly, in obese 
Taiwanese children, variants (which had previously been identified!” 
to strongly correlate with NAFLD in adults of European ancestry who 
were not all obese) did not correlate with NAFLD, but again this may be 
due to the obfuscating effects of obesity in this cohort; however, there 
was a significant association between glucokinase regulatory protein 
(GCKR) and NAELD"”. 


Reversal of NAFLD ameliorates hepatic insulin resistance 
The most effective intervention to reverse NAFLD and hepatic insulin 
resistance in humans is weight loss. Hepatic steatosis quickly resolves 
in both obese patients with type 2 diabetes and non-diabetic normal 
weight individuals with NAFLD after a hypocaloric diet and a modest 
weight loss of less than 10% of total body weight". This is accompanied 
by resolution of hepatic steatosis and normalization of fasting plasma 
glucose concentrations, hepatic glucose production and hepatic insulin 
sensitivity'’’"””, However, recidivism following weight loss is extremely 
common: less than 50% of those who have lost more than 10% of their 
body weight are able to maintain the weight loss after 1 year'”’, and after 
5 years, less than 25% have maintained their weight’. Nevertheless, 
improvements in insulin sensitivity and NAFLD can persist for 2 years 
after weight loss in overweight and obese individuals even after the 
weight is regained, arguing that there is a dissociation between obesity 
and insulin sensitivity or NAFLD". 

In regards to potential medical therapies to treat NAFLD, 
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thiazolidinediones, which are potent PPAR-y activators, have been shown 
to lead to significant reductions in liver-fat content and improvements in 
hepatic insulin sensitivity’® °. This effect probably occurs by thiazo- 
lidinedione activation of PPAR-y in subcutaneous fat tissue leading to 
increased insulin sensitivity and suppression of lipolysis, resulting in a 
redistribution of liver fat to the subcutaneous fat cells’'*. 

Increasing hepatic mitochondrial fat oxidation by promoting subtle 
increases in mitochondrial uncoupling could be another therapeutic tar- 
get for NAFLD, hepatic insulin resistance and type 2 diabetes. Promot- 
ing mitochondrial uncoupling by treating rats fed a high-fat diet with 
low doses of the mitochondrial protonophore 2,4-dinitrophenol (DNP) 
was shown to reduce hepatic triglyceride content, increase insulin-stim- 
ulated IRS2 tyrosine phosphorylation and PI(3)K activity, and improve 
insulin-mediated suppression of hepatic glucose production”’. Consist- 
ent with these findings, simply reversing hepatic steatosis using a liver- 
targeted DNP to increase hepatic mitochondrial fat oxidation by about 
60% was found to reverse hypertriglyceridaemia, hepatic and periph- 
eral insulin resistance, and hyperglycaemia in rat models of NAFLD 
and type 2 diabetes. Furthermore, correction of liver and muscle insulin 
resistance was associated with marked reductions in diacylglycerol-PKCe 
and diacylglycerol-PKC6 activity in liver and muscle, respectively; and 
occurred independently of any changes in body weight, inflammatory 
mediators, FGF21, adiponectin or liver ceramide content’’. Moreover, 
liver-targeting DNP resulted in a 50-fold increase in the ratio of toxic to 
effective doses compared with DNP. Taken together these data support 
the key role of diacylglycerol activation of novel PKC in mediating liver 
insulin resistance and demonstrate the potential feasibility of dissociat- 
ing the toxic effects of DNP from its beneficial effects to promote subtle 
increases in hepatic mitochondrial uncoupling and hepatic fat oxidation 
to treat the related epidemics of NAFLD and type 2 diabetes. 


Outlook 

Hepatic insulin resistance is a complex phenomenon. Although many 
questions regarding the nature of the insulin-signalling defect remain 
unanswered, hepatic insulin resistance is almost universally associated 
with increases in hepatic triacylglycerol and diacylglycerol concentra- 
tions, with the latter leading to PKCe activation and subsequent inhibi- 
tion of IRTK activity. The diacylglycerol-PKCe hypothesis of hepatic 
insulin resistance has recently been validated in humans with NAFLD. 
Exceptions to this rule can be attributed to compartmentation of dia- 
cylglycerols to the lipid droplet, in which it does not lead to inhibition 
of IRTK activity. NAFLD occurs when lipid supply to the liver exceeds 
rates of lipid oxidation and lipid export. Thus therapies targeted to 
reduce fatty-acid delivery to the liver, suppress diacylglycerol produc- 
tion, or raise mitochondrial fat oxidation by promoting subtle increases 
in hepatic mitochondrial uncoupling are of great interest to ameliorate 
NAFLD and type 2 diabetes. = 
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Pro-resolving lipid mediators are 
leads for resolution physiology 


Charles N. Serhan! 


Advances in our understanding of the mechanisms that bring about the resolution of acute inflammation have uncovered a 
new genus of pro-resolving lipid mediators that include the lipoxin, resolvin, protectin and maresin families, collectively 
called specialized pro-resolving mediators. Synthetic versions of these mediators have potent bioactions when adminis- 
tered in vivo. In animal experiments, the mediators evoke anti-inflammatory and novel pro-resolving mechanisms, and 
enhance microbial clearance. Although they have been identified in inflammation resolution, specialized pro-resolving 
mediators are conserved structures that also function in host defence, pain, organ protection and tissue remodelling. This 
Review covers the mechanisms of specialized pro-resolving mediators and omega-3 essential fatty acid pathways that 


could help us to understand their physiological functions. 


ing component in many chronic diseases, including vascular 

diseases, metabolic syndrome and neurological diseases, and 
thus is a public health concern. Understanding endogenous control 
points within the inflammatory response could provide us with new 
perspectives on disease pathogenesis and treatment approaches. Bar- 
rier break, trauma and microbial invasion create a need for the host 
to neutralize invaders, clear the site, and remodel and regenerate 
tissue. The acute inflammatory response is protective, providing 
a terrain in which lipid mediators — such as eicosanoids (prosta- 
glandins and leukotrienes)” produced from the essential fatty acid 
arachidonic acid, as well as many cytokines and chemokines’ > — 
have crucial roles in the initial response. Interactions among pros- 
taglandins, leukotrienes and pro-inflammatory cytokines amplify 
inflammation, the signs of which can be reduced by pharmacological 
inhibition and receptor antagonists’ °. However, given that exces- 
sive inflammation contributes to many widely occurring diseases, 
improvements are needed in treatment and in our understanding of 
the mechanisms involved. 

Pathologists divide the acute inflammatory response into ini- 
tiation and resolution (Fig. 1). Resolution was considered to be a 
passive process®. With the identification of mediators with pro- 
resolving capacity that could be biosynthesized from omega-3 (or 
n-3) essential fatty acids (EFA), evidence emerged that resolution 
of self-limited acute inflammation might be an active, programmed 
response that is ‘turned on’ in animal models, and not simply a pro- 
cess of passive dilution of chemoattractants’*. For a metabolite to 
fulfil the role of mediator, it must be produced in sufficient amounts 
in vivo to evoke bioactions. The omega-3 fatty acids EPA and DHA, 
which are found in marine oils, have long been thought to have 
anti-inflammatory properties, whereby they compete with arachi- 
donic acid, reducing pro-inflammatory eicosanoids’. The molecu- 
lar mechanism by which this occurs is unclear, and the evidence is 
inconclusive as to whether omega-3 EPA and DHA are beneficial for 
human health and as treatments for disease. Resolving inflammatory 
exudates use omega-3 fatty acids (Fig. 1) to produce structurally 
distinct families of signalling molecules — resolvins, protectins and 
maresins, collectively termed specialized pro-resolving mediators 


| ; xcessive inflammation is widely appreciated to be a unify- 


(SPMs). This realization prompted new interest in resolution path- 
ways and the innate immune mechanisms for homeostasis. SPMs 
are agonists with the potential to stimulate key cellular resolution 
events, namely limiting polymorphonuclear neutrophil infiltration 
and enhancing macrophage clearance of apoptotic cells”®, as shown 
in pre-clinical studies using animal models”. This Review addresses 
the role of novel lipid-derived SPMs in resolution that also func- 
tion in host defence, pain and tissue regeneration, which could help 
us to understand the role of SPM pathways in human resolution 


physiology. 


Cellular events in resolution of acute inflammation 

During the initiation phase, leukocytes traffic from circulation to the 
site of trauma or microbial invasion, forming inflammatory exudates 
— conventionally these are considered to be the ‘battlefields’ where 
most resolution occurs®. The first responders, neutrophils, swarm 
like bees to defend the body, moving along chemotaxic gradients 
(for example, increasing levels of the leukotriene LTB,) Mand exiting 
venules governed by prostaglandins (such as PGE, and PGI,) that 
act on vascular cells and blood flow’ (Fig. 1). These lipid mediators, 
along with many cytokines, chemokines and complement compo- 
nents (C5a and C3b), stimulate the chemotaxis of neutrophils into 
tissues to phagocytize and neutralize invaders® °. Many therapeutic 
agents block or antagonize the initiation steps of acute inflammation 
(for example, prostaglandin biosynthesis inhibitors or chemokine 
receptor antagonists)’*””. At acellular level, the main events of 
resolution are the cessation of neutrophil influx and efferocytosis 
(macrophage clearance of debris, including apoptotic neutrophils)*”. 
The protective acute inflammatory response evolved to permit the 
repair of injured tissues and eliminate invading organisms. Ide- 
ally, it is a self-limited process, leading to complete resolution that 
enables a return to homeostasis (Fig. 1). By studying self-limited 
inflammation in animal models, and using a systems approach to 
investigate resolving exudates, novel bioactive products derived 
from essential fatty acids were uncovered. The bioactions of these 
products include limited neutrophil influx in vivo, reduced human 
neutrophil transmigration and counter-regulation of cytokines such 
as tumour necrosis factor-a (TNF-a) in mice. Each bioactive product 
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Figure 1 | Lipid mediators in the acute inflammatory response, resolution 
and other outcomes. Lipid mediators have pivotal roles in the vascular 
response and leukocyte trafficking, from initiation to resolution. Lipoxins, 
resolvins, protectins and maresins, collectively called specialized pro-resolving 
mediators (SPMs), are produced during this self-limited response (Fig. 2). After 
barrier break, tissue injury or trauma, eicosanoids are crucial in initiating the 
cardinal signs of inflammation (redness, heat, pain and swelling). As part of the 
vascular response, leukocytes traffic to the site of injury. The prostaglandins 
PGE, and PGI, (involved in vasodilation) and the leukotriene LTB, (involved 
in chemotaxis and adhesion) stimulate the migration of neutrophils to the 
tissue. As part of the neutrophil-monocyte sequence, lipid mediator (LM) class 
switching from eicosanoids to lipoxins signals ‘stop’ and begins the end of the 
acute inflammatory response. Lipoxins and resolvins stimulate the recruitment 
of non-phlogistic monocytes. The resolving macrophages then clear apoptotic 


was systematically evaluated, and found to be biosynthesized from 
EPA and DHA by separate pathways in human neutrophils and mac- 
rophages in vitro’*”’. 


Elucidation of pro-resolving lipid-derived mediators 

The anti-inflammatory process is not equivalent to pro-resolution, 
which involves SPMs acting as agonists to stop further neutrophil 
influx and the activation of nonphlogistic responses by macrophages 
and resolution programs (Fig. 1). The key to understanding this 
difference was the identification of novel families of autacoids and 
forms of these molecules that could potentially be triggered by aspi- 
rin, providing evidence that, in animal models, resolution is actively 
orchestrated by lipid mediators (Fig. 2 and Box 1). Harnessing the 
pathways that stimulate resolution will be a challenge for the future”. 

Omega-3 fatty acids are a widely taken dietary supplement, but 
less than 25% of these supplements are prescribed by health-care 
providers'*. Given the mixed results'°on the benefit of these supple- 
ments in clinical trials, it is crucial for public health that the mecha- 
nisms that underlie their requirement are established. 

Using a systems approach with resolving mouse exudates was key 
to elucidating SPM actions and pathways**. Biosynthesis with human 
leukocytes and complete stereochemistry of each major resolvin 
(RVE1, RvD1, RvD2, RvD3 and RvD5), protectin and maresin was 
accomplished by matching these mediators with those prepared by 
total organic synthesis, which also confirmed their potent bioactiv- 
ity'’’”. Resolvins and protectins control the magnitude and duration 
of inflammation in animal disease models’*, and can, for example, 
increase animal survival’? (Fig. 2). The potent in vivo actions of 
RvD1 and RvD2 are reported in many pathologies, such as obesity 
and pathologies affecting the vascular’, airway”, dermal, renal and 
ocular systems, and in processes including pain, fibrosis and wound 
healing’*’’. Their role in governing neutrophil influx, resolution 


neutrophils in a process called efferocytosis (stimulated by resolvins and 
protectins). After this has taken place normal structure and homeostasis can 
be restored. Signs of resolution include sequestration of pro-inflammatory 
cytokines, clearance of neutrophils from epithelial surfaces, phagocytosis of 
apoptotic neutrophils and removal of inflammatory debris and microbial 
invaders. Failed resolution can lead to increased levels of prostaglandins and 
leukotrienes, chronic inflammation (which can be inhibited by resolvins) 

and fibrosis. SPMs counter-regulate pro-inflammatory chemical mediators, 
reducing the magnitude and duration of inflammation, and stimulate 
re-epithelialization, wound healing and tissue regeneration in model organisms. 
In addition to the release of omega-3 substrates from phospholipid stores”, 
these substrates can enter exudate as a result of oedema from peripheral blood, 
as shown in mice”. SPMs enhance efferocytosis, stimulate signs of resolution 
and signal to adaptive immunity. 


macrophages and reducing pro-inflammatory mediators (PAF, LTB, 
and prostaglandins) seem to be fundamental in all organs. 


Resolvin D3 and the maresin pathway 

Within self-limited exudates, RvD3 has a unique time frame com- 
pared with RvD1 and RvD2. In mouse peritonitis, RvVD3 appears 
late in resolution, suggesting it has a specific role. The complete 
stereochemistry of RvD3 was recently established”, confirming its 
potent anti-inflammatory and pro-resolving actions*. Macrophage 
biosynthesis of maresin MaR1 and its potent pro-resolving and tissue 
regenerative actions” (Fig. 2) involve an active 13S,14S-epoxide- 
maresin intermediate, stimulating a macrophage phenotype switch 
from M1 to M2 (ref. 24) (Fig. 2). The switch towards the M2 phe- 
notype is associated with reparative and anti-inflammatory mac- 
rophage functions”. 


Resolution agonists and resolution disrupters 

Several lessons have emerged from recent studies. Prostaglandins are 
central to vascular responses, permitting neutrophils and monocytes 
to leave post-capillary venules (a process called diapedesis). Their 
production through COX-1 and COX-2 (also known as prostaglan- 
din G/H synthase 1 and 2) is crucial for initiation and timely resolu- 
tion”*”’ (Fig. 1). PGE, and PGD, each evoke pro-inflammatory and 
anti-inflammatory responses that depend on tissue location’. PGE, 
enhances LTB,-mediated neutrophil extravasation and tissue injury 
that is blocked, for example, by topical administration of synthetic 
lipoxin A, (LXA,) and its aspirin-triggered epimer 15-epi-LXA, 
(ref. 28), illustrating both a pro-inflammatory PGE, function in 
mouse skin and the ability of 15-epi-LXA, mimetics to limit neu- 
trophil infiltration and tissue injury. Liquid chromatography with 
tandem mass spectrometry (LC-MS-MS)-based profiling demon- 
strated the temporal switch from an environment with high levels 
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of prostaglandins and LTB, to one with high levels of lipoxins, a 
process known as lipid-mediator class switching, in mouse exudates 
(Fig. 1 and Box 1). PGE, or PGD, added to isolated human neutro- 
phils increases 15-lipoxygenase type I translation from messenger 
RNA stores in a cyclic- AMP-dependent manner, increasing lipoxin 
biosynthesis (identified using MS-MS spectra)”*. 

Inhibition of COX-2 delays resolution because prostaglandins 
have crucial roles in resolution and because they are initiators of 
lipid-mediator class switching (Fig. 1), as shown in animal disease 
models in vivo'***”?. When mapping resolution, it became apparent 
that initiation signals the end of inflammation’ and that leukocyte 
traffic in pus permits prostanoids to signal the biosynthesis of other 
resolution mediators (Fig. 1 and Fig. 2). For example, disruption of 
physiological lipid-mediator class switching has deleterious conse- 
quences in mouse models of arthritis”. 

To pinpoint the crucial steps and mechanisms of SPM action 
within inflammation resolution, it was important to introduce 
quantitative indices'**’ that enabled the assessment of resolution 
in vivo~'*’*", Resolution indices identified agents that stimulate as 
well as those that disrupt or delay resolution (resolution interval); 
for example, COX-2 and lipoxygenase inhibitors'*’’’. Specific SPMs 
shorten the resolution interval by limiting neutrophil recruitment 
and stimulating both macrophage efferocytosis (Fig. 1) and bacte- 
rial killing*’****, demonstrating the neutrophil-monocyte sequence 
and the macrophage responses needed for tissue regeneration”. 
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Figure 2 | Production of specialized pro-resolving mediators in resolving 
inflammatory exudates. a, A typical self-limited acute inflammatory 
response time course in experimental settings from initiation (time 0) 

to resolution: oedema; neutrophilic infiltration, including the temporal 
biosynthesis of specialized pro-resolving mediators (SPMs) that occurs 

in resolving exudates; and non-phlogistic recruitment of monocytes 

or macrophages, which is required for homeostasis, and the repair and 
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Glucocorticoids, specific cyclin-dependent kinase inhibitors, statins, 
annexin peptides and aspirin all enable resolution®’”*”*. In the same 
way there are many mediators in the initiation of inflammation, 
there are also many endogenous mediators and drugs that have an 
impact on resolution®’*”. 

Although both aspirin and non-steroidal anti-inflammatory drugs 
(NSAIDs) inhibit prostanoid biosynthesis, aspirin is an irreversible 
inhibitor that acetylates COX-1 and COX-2, whereas other NSAIDs 
are reversible inhibitors’. Aspirin acetylation of COX-2 modifies 
the catalytic domain, blocking prostaglandin biosynthesis’”, but 
it remains active, producing 15R-HETE from arachidonic acid, 
18R-HEPE from EPA and 17R-HDHA from DHA in cells carrying 
COX-2. These products can be transformed by human neutrophils 
in vitro to aspirin-triggered lipoxins, aspirin-triggered resolvins’* 
and aspirin-triggered protectins”. Each has a potent effect, stopping 
human neutrophil migration and enhancing macrophage clean up, 
improving resolution in mice. Whether aspirin or statins enhance 
the production of aspirin-triggered SP Ms in humans remains to be 
established using mass spectral-based identification. And whether 
aspirin favours resolution in humans, in whom distinct resolution- 
phenotypes have emerged, is of considerable interest””**. In mice, 
intravascular LXA, is produced by platelet-neutrophil aggregates 
during ischaemia, which reduces vascular inflammation. Aspirin 
triggers 15-epi-LXA,, which was identified using LC-MS-MS, and 
this was less effective in mice deficient for the lipoxin receptor ALX, 
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regeneration of injured tissues (involving specific microRNAs regulated by 
SPM receptors). b, Resolution omega-3 metabolome. Biosynthesis of resolvins, 
protectins and maresins from EPA and DHA with the main bioactive structures 
from each family (Box 1; see refs 17 and 20 for details of the biosynthetic 
mechanisms and stereochemical assignments of the bioactive products). Each 
SPM stimulates macrophage switching to the M2 phenotype and is produced 
by human neutrophils, apoptotic neutrophils and macrophages’. 
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Bioactive products of omega-3 were originally isolated from mouse 
resolving exudates. The structure of these products was elucidated, 
and the biosynthesis of each new omega-3 family from EPA and 

DHA was recapitulated with isolated human neutrophils®° and 
macrophages in vitro**, Biosynthesis of E-series resolvins is initiated 
by the molecular oxygen insertion at the carbon-18 position of 

EPA produced by hypoxic human endothelial cells and acetylated 
COX-2, or p450. Microbial p450 (ref. 7) (Fig. 2, left) also produces 
18-HEPE, which is converted to bioactive E-series members by 
human neutrophils. Both aspirin-dependent and independent 
formation occurs in human peripheral blood*”*. The resolution 
metabolome also activates 17-lipoxygenation of DHA by hydrogen 
abstraction; 17S-HpDHA is converted to resolvin-epoxide-containing 
intermediates (Fig. 2, centre) that are transformed to resolvins D1-D4 
by human neutrophils, each identified using methods such as mass 
spectrometry of bioactive products. The 17S-HpDHA intermediate 

is also a precursor to 16,17-epoxide-protectin, which is converted 


Resolution omega-3 metabolome 


to protectin D1/neuroprotectin D1 and related structures by 

human leukocytes (neutrophils and T cells), neural cells and retinal- 
pigmented epithelial cells*°”°"!*, Maresins isolated from human 
and mouse macrophages are produced by initial lipoxygenation 

with molecular oxygen insertion at the carbon-14 position to form 
the hydroperoxide intermediate, which is rapidly converted to 
13S,14S-epoxide-maresin and enzymatically converted to maresin-1 
(ref. 24) (Fig. 2, right). Lipoxygenase mechanisms involve hydrogen 
abstraction and molecular oxygen insertion at specific carbon 
positions that are predominantly in the S configuration. Aspirin, by 
COX-2 acetylation and p450 enzymes, contributes to the biosynthesis 
of R-configuration alcohols in lipoxins, resolvins and protectins’. The 
stereochemistry of each bioactive specialized pro-resolving mediator 
family member (Fig. 2) has been determined, and the biosynthesis 
and potent pro-resolving and anti-inflammatory actions in murine 
exudates and human tissues has been confirmed (see ref. 17 for 
original reports and refs 10, 20 for reviews). 


providing in vivo evidence that aspirin can jump-start resolution 
circuits in mice”. 


New mechanisms in local SPM biosynthesis 

Microparticles are membrane-derived vesicles produced by a range of 
cell types that contribute to human pathologies. Microparticles from 
self-resolving exudates show anti-inflammatory and pro-resolving 
capacity”’ in mice. Resolution microparticles enhance efferocyto- 
sis” and carry pro-resolving signals, including hydroxy-SPM- 
intermediates esterified in phospholipids*’. Secreted PLA, (sPLA,) 
prompts the release of pro-resolving signals from microparticles for 
trans-cellular conversion by human macrophages in vitro'**’. Because 
nanomedicines are useful for local targeting and delivery, resolution 
microparticles, and their ability to shorten the resolution interval 
in mouse peritonitis, were used as a basis for biomimicry to con- 
struct humanized nanoparticles containing an LXA, analogue, or 
aspirin-triggered-RvD1 (ref. 40). These nano-pro-resolving medi- 
cines (NPRMs), carrying SPMs or SPM analogues, enhance wound 
healing of human keratinocytes and are protective in a mouse model 
of temporomandibular joint disease characterized by inflammation- 
induced bone loss” (Fig. 1 and Box 2). 

Microparticles can also transfer substrate and intermediates to 
macrophages during efferocytosis to enhance SPM biosynthesis, 
which was demonstrated by transfer of deuterium labels from pre- 
cursors to SPMs (identified using LC-MS-MS)"’. Myeloid cells at 
different stages have agonist- and phenotype-specific lipid mediator 
profiles. For example, human neutrophils from healthy peripheral 
blood predominantly produce LTB,, whereas apoptotic neutrophils 
produce PGE,, LXB, and RvE2 signals”. 

Both M1 and M2 macrophages have specific markers and pathways 
that are specialized to the functions of that subpopulation in inflam- 
mation and resolution”. Human M2 macrophages have more of the 
enzymes” needed for cell-type-specific lipid mediators. M2 cells pro- 
duce SPM profiles with lower levels of LTB, and prostaglandins than 
MI cells. Both cell types engulf apoptotic neutrophils, changing their 
lipid mediator signature profiles. In M2 cells, LTB, is downregulated and 
SPMs are increased”, suggesting that M1 and M2 subpopulations”*! 
produce functional lipid mediator signatures that can affect both physi- 
ological and pathophysiological states’’. In addition, group IID sPLA, 
was identified as a resolving sPLA, that was expressed in dendritic cells 
and macrophages that release substrates with the capacity for producing 
RvD1 and PGJ,, as identified by mass spectrometry in mouse lymphoid 


tissue in vivo”. The functional contributions of this resolving sPLA, to 
the inflammatory response in humans is unknown. 

The role of eosinophils is well appreciated in parasitic infections 
and allergic responses. In patients with severe asthma, protectin D1 
(PD1) is present in exhaled breath condensates”, and is lower in 
eosinophils from these patients“. Human eosinophils produce PD1, 
which reduces adhesion molecules (CD11b and L-selectin), eotaxin-1 
and chemotaxis at nanomolar concentrations, without affecting 
degranulation, superoxide generation or cell survival. Eosinophils 
also stimulate resolution in mouse peritonitis through SPMs ini- 
tiated by mouse eosinophils**. LC-MS-MS-lipidomics identified 
LXA,, RvD5, 17-HDHA and PD1 from eosinophils, and RvE3 in 
vivo (Fig. 2), which limit neutrophil infiltration and regulate mac- 
rophages****. Hence, by their ability to produce SPMs, eosinophils 
might contribute to resolution. In support of this, eosinophil deple- 
tion has been found to lead to failed resolution, which is rescued by 
PD1 or eosinophil restoration in mice. Thus, cellular traffic to sites 
of inflammation has a dynamic impact on lipid mediator signatures 
and specific SPM metabolomes activated within the local milieu. 


SPM cellular actions in disease models 
SPMs increase survival in diverse mouse models; for example, inflamma- 
tion and tissue injury in the airway, skin and eye that result from collateral 
damage are improved with exogenous SPMs"°. The SPM nanomolar doses 
required to stop ongoing inflammation and promote resolution rely on 
G-protein-coupled receptors (GPCRs). Several SPM receptors have been 
identified using GPCR screening, labelled ligands for specific binding 
(stereospecific nanomolar dissociation constant) and functional cellular 
responses” “”. In general, SPMs do not utilize intracellular calcium (Ca”*) 
mobilization in neutrophils for signal transduction but instead activate 
phosphorylation. RvE1 specifically binds to the receptors ChemR23 
(ref. 47) and BLT1 to evoke pro-resolving responses. RvE] activation of 
ChemR23 enhances macrophage phagocytosis by phosphoprotein-medi- 
ated signalling’. RvE1 blocks LTB, binding and promotes, through BLT1, 
apoptosis of neutrophils for their clearance by macrophages”, whereas 
LTB,-BLT1 signals neutrophil survival. The blocking of survival signals 
to neutrophils by RvE1 marks an important difference from the process 
that occurs in the innate response, in which neutrophils must undergo 
timely apoptosis for clearance*” (Fig. 2). 

RvD1 binds and activates the human receptor GPR32, and binds to 
both the human and murine forms of the LX A, receptor (called ALX/ 
FPR2). Transgenic mice overexpressing human ALX/FPR2 require 
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BOX2 


A number of steps have led to the possibility of assessing SPM function 
in humans (Box 2 Fig.). The molecules derived from omega-3 and their 
bioactions have been identified in the resolving inflammatory exudates 
of mice. Studies have reported the structure of these molecules, their 
actions with human neutrophils and their proposed biosynthesis 

from EPA and DHA using 18-oxygen, 14-carbon and trapping of 
transient intermediates!”. Confirmation of the bioactive structures and 
total organic synthesis to assign complete stereochemistry to each 
specialized pro-resolving mediator (SPM) family member required 

the development of a strategy to match the biologically active isolated 
product with those prepared by total organic synthesis. NMR (nuclear 
magnetic resonance) confirmed the geometry of the conjugated 
double bond systems present in SPMs; for example, RvE1 (ref. 47), 
RvD1, RvD2 (ref. 34) or MaR1 (refs 23, 24) (ref. 17 describes the 
matching criteria). The resolving functions of SPMs are separate from 
those of anti-inflammatories; to serve as an SPM, the omega-3 product 
must be biosynthesized at the in vivo levels that are commensurate 
with the SPM’s bioactions. For many of the SPMs, this criterion has 
been achieved and confirmed with commercial resolvins (see ref. 10 
for a review). 

SPMs have been identified as conserved structures that are present 
in trout (PD1 and RvD1)*, salmon (RvD1 and RvD2)""’ and planaria 
(RvE1 and MaR1)*. Microfluidic chambers that permit visualization 
of cell-cell interactions between leukocyte subpopulations (human 
neutrophils and monocytes) and distinguish phlogistic compared with 
non-phlogistic phagocyte behaviour are ideal for screening SPMs and 
humanized nano-pro-resolving medicines (NPRMs)*°°°. This single-cell 
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screening permits the optimization of enrichment of microparticles 
with SPMs, the production of NPRMs and the viewing of neutrophil- 
monocyte interactions°©°’, which are essential for appreciating the 
signals that occur during the neutrophil-+monocyte sequence (Fig. 1). 
Using microfluidic chambers, single human neutrophils were 
assayed within about 5 minutes of their capture from whole blood 
(minimizing the isolation time reduces potential artefacts). Nanomolar 
concentrations of each SPM prevents human neutrophil migration to 
interleukin-8 (IL-8), whereas at equimolar doses the precursor (DHA) 
is not active®**’, Resolution indices permit the quantitative assessment 
of SPM actions in animal models'®*°; this is essential for defining the 
SPMs within the integrated response of the host to acute inflammatory 
challenges. Demonstration and identification of omega-3 SPMs in 
human tissues is required so that we can appreciate their potential 
roles in humans. RvE1 and RvE2 (refs 47, 61) were identified in the 
peripheral blood of healthy volunteers, some of whom had been given 
EPA supplements, using LC-MS-MS multi-reaction monitoring. RvD1, 
RvD2 (ref. 111), PD1 and 17-HDHA were identified in the exhaled 
breath condensates of people with asthma’; and additional SPMs 
were identified in human adipose tissue?!®, the brains of people with 
Alzheimer’s disease*®!™, and patients with multiple sclerosis!!? and 
rheumatoid arthritis'° using mass spectral identification. Although, 
at present, the demonstration of omega-3 in human tissues is in the 
initial stages, with the capabilities of LC-MS-MS-based profiling’’, 
SPMs could have potential as markers for nutritional status. The field 
is now set with tools for assessing SPM function in humans and the 
relationships of SPMs with nutrition and disease. 


Identification in 
human tissues 


Single-cell screening 
leading to biosynthesis 
with human cells 


Bioactions in 
animal models 


Endogenous biosynthesis and 
actions in mouse disease models 


less RvD1 than wild-type mice to stop inflammation”, and in receptor- 
deficient mice, RvD1 seems to have no leukocyte-directed actions”. 
Resolution involves specific microRNAs (miRNAs) regulated by SPM 
receptors”, RVD1-GPR32 upregulates miR-208 and anti-inflamma- 
tory interleukin-10 (IL-10), and downregulates miR-219, decreasing LTB, 
levels through the regulation of 5-lipoxygenase™. This miRNA regulation 
is an example of SPM signalling with a sustained tissue impact. 

SPM receptors are also rapid signallers. For example, recombinant 
RvD1-GPR32 blocks histamine H1 receptor-stimulated increases 
in intracellular Ca** in Chinese hamster ovary (CHO) cells by rapidly 
stimulating the phosphorylation of the H1 receptor, stopping Ca** mobi- 
lization™. This form of SPM signalling, which has been reported in con- 
junctival goblet cells and RvD1, is also functional in salivary glands® and 
is probably relevant in human neutrophils, which rapidly stop chemotaxis 
in microfluidic chambers and change shape on exposure to SPMs**”. 
RvD3 and RvD5 can also activate recombinant GPR32 (refs 22, 33). 
Given the temporal production of RvD3 in vivo” (Fig. 2), these findings 
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underscore that locally produced SPMs can affect different cell types and 
receptors with a spatio-temporal dependency. 

In addition to RVD1 and LXA,, ALX/FPR2 is activated by peptide 
pro-resolving mediators, such as annexin A1, as well as high concen- 
trations of pro-inflammatory peptides**. This capacity of ALX/FPR2 
involves ligand-biased receptor activation and heterodimerization of 
ALX/FPR2, with related FPRs dictating pro-inflammatory signalling 
and homodimerization of ALX/FPR2 determining pro-resolving sig- 
nalling**. LXA, also enhances the activity of the ALX/FPR2 promoter, 
which is mutated in human cardiovascular disease™. 


Infection and resolution programs 

Topical RVE1 and LXA, each reduce the severity of periodontal disease in 
rabbits by enhancing clearance of the causative organism Porphyromonas 
gingivalis”. Although the anti-inflammatory actions of SPMs were 
uncovered in sterile inflammation models”, the relationship between 
resolution and infection is of interest because of the known eventual 
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immunosuppressive actions of anti-inflammatory drugs”. Surprisingly, 
RvD2 protects mice from caecal ligation puncture (CLP)-induced sepsis”; 
it has potent actions, enhancing phagocytosis and bacterial killing. In 
self-limited Escherichia coli infections, resolution programs are activated 
in mice, and levels of host PD1, RvD5 and RvD1 are elevated**. When 
SPMs identified in vivo were added back to mouse phagocytes, human 
macrophages or neutrophils as synthetic SPMs, they enhanced bacterial 
phagocytosis and killing, as well as clearance***'. Importantly, when 
the host is given SPMs that are known to act on the host, lower antibiotic 
doses are needed to clear infections”. 

Exogenous LXA, is also protective in rat CLP-induced sepsis, reducing 
bacterial burden and pro-inflammatory mediators through a macrophage 
nuclear-factor-«B (NF-«B)-mediated mechanism, which reduces sys- 
temic inflammation”. Aspirin-triggered-LXA, increases E. coli phago- 
cytosis in a phosphatidylinositol-3-OH kinase (PI(3)K)- and scavenger 
receptor-dependent manner, and ALX/FPR2 is upregulated in patients 
with Crohn’s disease and enhances bacterial clearance”. Mycobacterium 
tuberculosis infections are susceptible to modulation of LTA, hydrolase. 
Besides altering LTB,, this may also engage resolution programs by acti- 
vating LTB,-LXA, production, regulating host responses in zebrafish, 
mice and possibly in humans™®. Given the problem of rising antibiotic 
resistance, activation of resolution programs could provide new antimi- 
crobial approaches to lower our exposure to antibiotics”. 

The herpes simplex virus causes ocular infections that lead to stromal 
keratitis with viral-initiated immunopathology. RvEI and PD1 are both 
potent and topically active in mouse models of the infection, reducing 
pro-inflammatory mediators and stimulating IL-10 (refs 66, 67). Lethal 
dissemination of the H5N1 virus activated genes in mice that are associ- 
ated with lipoxin biosynthesis, whereby sustained inflammation inhibited 
lipoxin-mediated anti-inflammatory host responses, permitting viral dis- 
semination™. H5N1 activates the host resolution-metabolome increasing 
PD1 levels (identified by LC-MS-MS)®. Host protectins show antiviral 
activity, blocking replication of the H5N1 influenza virus. During the time 
course of infection with the low-pathogenicity strain of influenza H3N2, 
anti-inflammatory mediators are produced that correlate with resolution 
and SPM-related pathway markers”. SPMs are also found in yeast infec- 
tions; for example, Candida albicans infections, in which RVE1 enhances 
yeast killing and clearance in mice”. These results support the idea that 
treating the infected host with host-directed pro-resolving molecules 
could provide new opportunities to reduce inflammation and enhance 
clearance by exploiting host-pathogen interactions**'”. 


Chronic inflammatory disease models 
Periodontal disease is a chronic inflammatory disease in which infection 
leads to neutrophil-mediated tissue injury around the tooth. Activated 
neutrophils from patients with periodontitis produce PGE,, LTB, and 
LXAy each identified using LC-MS-MS”. PGE, in this tissue leads to 
bone loss. P. gingivalis elicits neutrophil recruitment in mouse air pouches 
along with COX-2 upregulation. LXA, stable analogues reduced both 
neutrophil influx and COX-2 expression that was associated with the 
oral pathogen”. In addition, P gingivalis increased COX-2 expression in 
the lung and heart of mice, and 16S ribosomal RNA of P. gingivalis was 
present in these tissues, providing evidence of a role for this oral patho- 
gen in the development of systemic inflammation. Transgenic rabbits 
overexpressing human 15-lipoxygenase type I produce 6-10 times the 
amounts of LXA,, which was identified using LC-MS-MS, than wild- 
type-rabbits”; transgenic rabbits have less bone loss in periodontitis and 
markedly reduced neutrophil recruitment and vascular leakage through 
their skin on immune challenge compared with wild-type rabbits, sug- 
gesting that overexpression of lipoxin biosynthesis is protective and could 
be useful in controlling inflammation-mediated bone destruction”. 
Unexpectedly, overexpression of 15-lipoxygenase in these transgenic 
rabbits sharply reduces atherosclerotic lesions”. In 12/15-lipoxygenase 
transgenic mice, LC-MS-MS identified RvVD1, PD1 and 17-HDHA with 
reduced PGE, from activated macrophages”. LXA,, PD1 and RvD1 
each reduced cytokines (for example, MCP-1) from endothelial cells 
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and adhesion molecules (P-selectin and VCAM-1) but not ICAM-1. 
LXA,, PD1 and RvD 1 also enhanced uptake of apoptotic thymocytes, 
which could contribute to the anti-atherogenic role of this pathway in 
mice. This process can also be manipulated by diet to govern severity 
of atherogenic lesions’”*”’. RvE1 is protective in periodontal disease but, 
unlike LXA,, exogenous RVE] also stimulates bone regeneration in rab- 
bit models of periodontitis”. In murine models of arthritis, RVD1 and 
17-HDHA reduce pain and tissue damage, proving more potent than 
either steroid or pain treatments”. 

Unresolved inflammation, epithelial and microvascular injury can 
lead to excessive fibrosis (Fig. 1) that impairs organ function. Leu- 
kotrienes are profibrotic, and in people with scleroderma interstitial 
lung disease, LXA, is present in the lungs at levels that seem unable to 
counter-regulate profibrotic factors”. In animals, an exogenous aspi- 
rin-triggered lipoxin analogue reduces pulmonary fibrosis induced 
by the antibiotic bleomycin®, and both LXA, and benzo-LXA, reduce 
renal fibrosis”. Exogenous RvE1 and RvD1 protect from renal fibrosis, 
reducing collagen I and IV, a-SMA and fibronectin®. In addition, exog- 
enous RvD1 reduces pro-inflammatory mediators that are generated in 
response to cigarette smoke and pulmonary toxicants”. 


Organ regeneration and wound healing 

In mice, exogenous SPMs have been found to stimulate would healing 
ina range of studies. LXA, stimulates re-epithelialization of the cor- 
nea ina gender-specific fashion in mice™; when applied to wounds, 
RvE1, RvD1 and RvD2 each stimulate murine dermal healing, reduc- 
ing neutrophilic infiltration and stimulating re-epithelialization®; 
RvD1 and RvD2 also stimulate wound healing in diabetic mice”. 
Macrophages have a role in wound healing and organ regeneration, 
and the macrophage-derived maresin pathway stimulates tissue 
regeneration. This pathway (Fig. 2) is present in the flatworm Dug- 
esia tigrina, in which RvE1 and MaR1 were each found to reduce 
regeneration times (speed of head-segment regrowth)”. Given the 
importance of tissue regeneration in trauma and infection, regulation 
of resolution programs could hold promise as a therapy. 


SPMs in adaptive immunity 

Lymphoid tissue such as mouse spleen produces RvD1, 17-HDHA, 
PD1 (ref. 87) and LXA, (ref. 88) from endogenous sources, which 
was identified using mass spectrometry, suggesting these products 
are strategically positioned to act on lymphocytes (Fig. 1). Both 
exogenous 17-HDHA and RvD1 increase human B-cell immuno- 
globulin-M and immunoglobulin-y, a response that was not shared 
by PD1. B-cell differentiation is enhanced by exogenous 17-HDHA 
towards the CD27*CD38* antibody-secreting cell phenotype”. PD1 
is biosynthesized by human T-helper-2-skewed mononuclear cells 
by a 16(17)-epoxy-protectin intermediate (Fig. 2) and reduces T-cell 
migration, TNF-a and interferon-y, promoting T-cell apoptosis”. 
Exogenous LXA,, RvE1 and PD1 each upregulate expression of the 
chemokine receptor CCR5 on leukocytes, facilitating their clearance 
and resolution”. Exogenous PD1 reduces CD4* T-cell infiltration 
into cornea’, as does RvE1 (ref. 67), in herpes simplex viral infec- 
tions. Exogenous RvD1 reduces CD11b* leukocytes and CD4* and 
CD8* T lymphocytes in uveitis’'. Exogenous RvE1 and RvD1 each 
regulate T-cell activation in choroid and retina, and are biosynthe- 
sized in this tissue, as identified by LC-MS-MS”. Exogenous RvE1 
induces apoptosis of activated T cells by the induction of 2,3-dioxy- 
genase in dendritic cells, representing a new functional subtype of 
dendritic cells that have a role in resolution”’. Exogenous RvE1 has 
also been shown to reduce numbers of mouse CD4" T cells and CD8* 
T cells in atopic dermatitis”. 


Neuroinflammation and pain 

Mouse and human brains have the capacity to produce resolvins and 
protectins, as do human microglial cells, in which they reduce cytokine 
expression®”*”*. And the production of SPMs by the brain cells of trout 
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indicates that they are conserved from fish to humans”. In ischaemic 
mouse brains, immunoreactive resolvins, protectins and their aspirin- 
triggered forms are produced”; in these mice synthetic SPMs are pro- 
tective, downregulating excess leukocyte infiltration and reducing local 
neuronal injury, COX-2 induction, and levels of IL-1$ and NF-«B. Thus, 
in the brain, DHA could be a precursor to neuroprotective signalling 
pathways evoked by ischaemia reflow tissue injury. The DHA product 
10,17-dihydroxy-protectin is called neuroprotectin D1 (NPD1) when 
biosynthesized and acting in neural tissues and retinal epithelial cells 
owing to its potent actions to reduce neuroinflammation and protect 
neural cells”. 

DHA is enriched in the brain, synapses and the retina, in which it is 
known to have a protective role, but its role as a precursor to mediators 
in resolution and neuroprotection is still emerging. Synthetic NPD1 has 
a potent protective role in the nervous system, reducing stress pathways 
that lead to cell death and increase cell survival, and in several ocular 
models of important diseases (such as herpes™ and neovascularization 
in the eye”) NPD1 targets microglia*”*”’. Human neutrophils biosyn- 
thesize 17R-NPD1/PDI1 that is enhanced by aspirin in vitro, limiting 
neutrophil transmigration and enhancing macrophage efferocytosis””. 
Synthetic aspirin-triggered NPD1 reduces brain oedema in penumbra 
and subcortical lesion size, and improves neurological scores!’ 

In the brains of people with Alzheimer’s disease, NPD1 is reduced”® 
and the resolution pathway (SPM receptors and products) is dimin- 
ished’*'. LXA, and RvD1 are reduced in cerebrospinal fluid and hip- 
pocampus, which correlated with lower scores on the mini-mental state 
examinations in these patients. These findings provide further evidence 
that failed resolution of the inflammatory response could contribute 
to human disease’®’. RVD1 added to macrophages from patients with 
Alzheimer’s in vitro reduces the macrophages’ pro-inflammatory phe- 
notype and enhances phagocytosis of amyloid-B’”. This is consistent 
with the suggestion that resolvins promote clearance of amyloid-B depo- 
sition to reduce inflammation in Alzheimer’s. Hence, SPMs might have 
homeostatic roles in brain and peripheral tissues, whereby each SPM 
has selective functions to reduce neuroinflammation. 

Inflammation can evoke persistent pain. ALX/FPR2 is expressed on 
spinal astrocytes, and local spinal delivery of synthetic LX A,, LXB, or 
their metabolically stable analogues reduces inflammation-induced 
pain’. Many SPMs dampen pain; in mice, they have specific targets 
of action’, as demonstrated with RvE1 and RvD1 for inflammatory 
pain involving both central and peripheral sites’. RVE1 administered 
intrathecally in mice is more potently analgesic than morphine ora 
COX-2 inhibitor. The RvE1 receptor ChemR23 is present in dorsal root 
ganglia, in which synthetic RvE1 regulates phosphorylation of ERK- 
dependent transient receptor potential vanilloid subtype-1 (TRPV1) 
inhibition and TNF-a-mediated hyperalgesia, centrally. In postsynap- 
tic neurons, synthetic RvE1 inhibits glutamate and TNF-a stimulation 
of N-methyl-p-aspartic acid (NMDA) receptor and mechanical allo- 
dynia™”. Synthetic RVD1, RvD2 and PD1/NPD1 each reduce pain by 
the inhibition of specific TRPV channels”. 

MaR1 inhibits TRPV1 in neurons and blocks capsaicin-induced 
inward current (the half-maximum inhibitory concentration is 
0.49 nM), diminishing inflammatory and chemotherapy-evoked neu- 
ropathic pain in mice™. Both aspirin-triggered RvD1 and 17R-HDHA 
reduce adjuvant-induced arthritis in rats and associated pain’, reduce 
NF-«B and COX-2 expression in the spinal cord, and, in arthritic joints, 
reduce TNF-a and IL-1 levels. In addition to leukocytes and microglia, 
SPM receptors are present on neuronal bodies, nerve terminals (skin 
and muscle) and synaptic terminals, on which they regulate specific 
transient receptor potential channels. For example, RVE1-ChemR23 
interaction in dorsal root ganglion regulates TRPV1, but not by direct 
activation of channels like endocannabinoids™ or other lipids that 
directly bind transient receptor potential channels; instead, each SPM 
activates a specific GPCR in the pico-nanomolar range to regulate chan- 
nels involved in pain signalling. 

Direct comparisons between synthetic LXA, and aspirin-triggered 
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RvD1 mechanical hypersensitivity in a rat model of inflammation- 
induced pain indicate that both effectively reduce hypersensitivity and 
pro-inflammatory mediators from astrocytes’®*. Local increases in 
pro-inflammatory mediators can result in cognitive decline, which can 
occur after major surgery or critical illness and is a major public health 
concern. Systemic aspirin-triggered RvD1 prophylaxis helps to prevent 
memory decline in mice that have undergone surgery, protecting them 
from post-operative neuronal dysfunction”. Whether SPM actions in 
mouse pain models translate to reducing human pain and improving 
cognition remains of interest. 


Towards human translation 
Because resolvins and protectins were identified in mouse exudates, it 
was essential to establish their biosynthesis by human leukocytes and 
in human tissues’*”* (Box 2). Using mass spectrometry, RvE1 and RvE2 
can be identified in human blood*”""”, provided EPA or DHA is avail- 
able. RvD1 and RvD2 are found in human plasma and serum’. The 
capacity of murine placenta to form resolvins (RvD1 and RvD2) and 
protectins (PD1 and 10S,17S-di HDHA"”, also known as PDx) was con- 
firmed by mass spectrometry, and the levels of each were increased by 
dietary omega-3 (ref. 113). Strategically, SPMs are also present in human 
breast milk!™*: LXA,, RvD1 and RvE1 were identified in milk of moth- 
ers in their first month of lactation using mass spectrometry’. These 
identifications, made possible with LC-MS-MS and the availability of 
SPMs, could provide opportunities for the rigorous assessment of SPM 
functional roles in human physiology and their potential for therapy. 
Evidence for lipid-mediator class switching in humans has recently 
been reported’’. LC-MS-MS lipidomic analysis of venous blood col- 
lected after strenuous resistance exercise identified lipid mediators in 
peripheral blood. Prostanoids in the initial post-exercise recovery phase 
were temporally followed by leukotrienes and p450-derived eicosa- 
noids (EETs), as well as lipoxins, resolvins and protectins. The NSAID 
ibuprofen, which is widely used for muscle aches and pains, blocked 
exercise-induced prostanoids, reduced LTB,, and both delayed and 
diminished the SPM response, as identified by mass spectrometry. This 
study of resistance exercise in humans illustrates the acute response of 
pro-inflammatory mediators, presumably from muscle, and the media- 
tors’ potential link to resolution programs''*. However, the amount of 
resolvins and protectins reported are relative, and additional studies 
are needed to establish their levels in healthy exercising individuals. 
Ina 60-patient double-blind trial’ of infants with eczema, topical 
15(R/S)-methyl-LXA, relieved severity and improved quality of life 
without apparent adverse events. In these infants, the lipoxin analogue 
was as effective as a topical steroid. 


Looking forward 

For autacoids, it is when and where that counts, and it is important to 
emphasize that the first response in acute inflammation is ubiquitous 
and mounts throughout the body. SPMs are agonists of resolution. Each 
stimulates cessation of neutrophil influx, efferocytosis and enhances 
phagocytosis for microbial containment — signs of resolution (Fig. 1). 
These steps are the defining SPM functions. Each SPM pathway has 
additional, non-redundant functions on target cell types. At the cel- 
lular and molecular levels, SPMs counter-regulate pro-inflammatory 
mediators (eicosanoids, chemokines, cytokines* 3 and adipokines”); 
regulate specific miRNAs” and cell traffic; and enhance microbial kill- 
ing by receptor-mediated mechanisms in animal models in vivo and 
with human neutrophils and macrophages’*”*”'. 

Results from preclinical disease models” suggest that treatment of 
inflammation-associated disease might be possible with SPM agonists 
that stimulate resolution and protect organs from collateral damage 
(Box 2). RVE1, MaR1 and NPD1/PD1 are each in clinical development 
programs. An RvE1 mimetic is in clinical trials for ocular indications 
(http://www.auventx.com/auven/products/rx10045.php), and NPD1/ 
PD1 is in clinical development for neurodegenerative diseases (http:// 
www.anidapharma.com/lead-molecule.html); given their ability to 
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regulate inflammation resolution without immunosuppression, it is 
hopeful they will become therapies. 

Because identifying SPMs in human tissues relies on LC-MS-MS- 
based approaches and internal standards that have only recently become 
available, at this point relatively few studies have demonstrated the pres- 
ence of SPMs in human tissues (such as blood, milk, and adipose and 
brain tissue). Hence, evidence for SPM formation in humans is at a 
very early stage, and their functional importance in human health and 
diseases remains to be established. Whether SPMs have physiological 
actions in target tissues in humans, now that we know that they are pro- 
duced in levels that show potent selective actions in animals’, can now 
be addressed (Box 2) — along with the impact of anti-inflammatories 
on SPM pathways — using LC-MS-MS-based lipid-mediator-SPM 
profiling of human tissues. The SPMs produced by human apoptotic 
neutrophils and macrophages can be identified using LC-MS-MS 
profiling’ without omega-3 EFA supplementation; it is therefore now 
possible, with the sensitivity of LC-MS-MS, to determine individual 
human SPM profiles (personalized lipid mediator-SPM metabolomics). 
This personalized metabolomics can be used to determine the relation- 
ship to resolution mechanisms. It is also important to determine, in 
healthy individuals, the omega-3 EFA supplementation and doses that 
might increase or diminish SPMs within specific tissues. Importantly, 
it is crucial to assess whether human diseases that are characterized 
by excessive inflammation result from failed resolution mechanisms 
because of the absence of specific SPM pathways and whether these 
mechanisms can be rescued either by EPA or DHA, or by therapeutic 
versions of SPM mimetics. Given that resolution of inflammation is 
a fundamental process in all organs, new approaches for stimulating 
resolution and using SPMs as pathway markers could soon be possible, 
as could determining the relationship of SPMs to nutrition in humans. 

SPMs have emerged from animal models as potential regulators in 

physiological pathways of resolution and unresolved inflammation that 
can affect infection, pain, obesity, organ protection” and inflamma- 
tory diseases (Box 2) beyond the roles of their omega-3 EFA precursors 
in intermediary metabolism and membrane dynamics. Identification 
of SPM bioactive metabolomes and an appreciation that exudates can 
drive resolution, partly through SPMs, sets a new terrain to evaluate 
resolution physiology and pharmacology, in which SPMs are crucial as 
chemical signals for catabasis and host defence. 
Note added in proof: A paper recently appeared online while the cur- 
rent Review was in press reporting the quantification of pro-resolving 
mediators using a new approach in human tissues and their functional 
relationship to known eicosanoids™. 
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The self-assembly of proteins into highly ordered nanoscale architectures is a hallmark of biological systems. The sophis- 
ticated functions of these molecular machines have inspired the development of methods to engineer self-assembling 
protein nanostructures; however, the design of multi-component protein nanomaterials with high accuracy remains an 
outstanding challenge. Here we report a computational method for designing protein nanomaterials in which multiple 
copies of two distinct subunits co-assemble into a specific architecture. We use the method to design five 24-subunit 
cage-like protein nanomaterials in two distinct symmetric architectures and experimentally demonstrate that their 
structures are in close agreement with the computational design models. The accuracy of the method and the number 
and variety of two-component materials that it makes accessible suggest a route to the construction of functional protein 


nanomaterials tailored to specific applications. 


The unique functional opportunities afforded by protein self-assembly 
range from the dynamic cellular scaffolding provided by cytoskeletal 
proteins to the encapsulation, protection and delivery of viral genomes 
to new host cells by virus capsids. Although natural assemblies can be 
repurposed to perform new functions'”, this strategy is limited to the 
structures of existing proteins, which may not be suited to a given appli- 
cation. To overcome this limitation, methods for designing novel self- 
assembling proteins are of considerable interest*~*. The central challenge 
in designing self-assembling proteins is to encode the information nec- 
essary to direct assembly in the structures of the protein building blocks. 
Although the complexity and irregularity of protein structures resulted 
in slow initial progress in this area, advances in computational protein 
design algorithms and new approaches such as metal-mediated assem- 
bly have recently yielded exciting results*'°. Despite these advances, the 
self-assembling protein structures designed so far have been relatively 
simple, and continued improvements in design strategies are needed in 
order to enable the practical design of functional materials. 

The level of structural complexity available to self-assembled nano- 
materials generally increases with the number of unique molecular com- 
ponents used to construct the material. This is illustrated by DNA 
nanotechnology, in which specific and directional interactions between 
hundreds of distinct DNA strands allow the construction of nanoscale 
objects with essentially arbitrary structures'”°. In contrast, designing 
well-ordered multi-component protein nanomaterials has remained a 
significant challenge. Multiple distinct intermolecular contacts are nec- 
essary to drive the assembly of such materials***""’, and programming 
new, geometrically precise interactions between proteins is difficult. 
Compared to homo-oligomers, multi-component protein nanomater- 
ials offer several advantages: a wider range of possible structures due to 
their construction from combinations of building blocks, greater control 
over the timing of assembly, and enhanced modularity through inde- 
pendently addressable subunits. Although multi-component protein 
assemblies have recently been generated using disulphide bonds'*”, flex- 
ible genetic linkers’*’*”’, or stereotyped coiled-coil interactions to drive 


assembly’*”*, the flexibility of these relatively minimal linkages has 
generally resulted in materials that are somewhat polydisperse. Most 
natural protein assemblies, on the other hand, are constructed from 
protein-protein interfaces involving many contacts distributed over 
large interaction surfaces that serve to precisely define the positions of 
the subunits relative to each other****, Advances in computational 
protein modelling and design algorithms have recently made it pos- 
sible to design such interfaces” and thereby direct the formation of 
novel self-assembling protein nanomaterials with atomic-level accu- 
racy’”’°, but the methods reported so far have been limited to the 
design of materials comprising only a single type of molec- 
ular building block. Here we expand the structural and functional range 
of designed protein materials with a general computational method for 
designing two-component co-assembling protein nanomaterials with 
high accuracy. 


Computational design method 

Our method centres on encoding the information necessary to direct 
assembly in designed protein-protein interfaces. In addition to pro- 
viding the energetic driving force for assembly, the designed interfaces 
also precisely define the relative orientations of the building blocks. 
We illustrate the method in Fig. 1 using the dual tetrahedral architec- 
ture (designated here as T33) as an example. In this architecture, four 
copies each of two distinct, naturally trimeric building blocks are aligned 
at opposite poles of the three-fold symmetry axes of a tetrahedron (Fig. 1a). 
This places one set of building blocks at the vertices of the tetrahedron 
and the other at the centres of the faces, totalling 12 subunits of each 
protein. Each trimeric building block is allowed to rotate around and 
translate along its three-fold symmetry axis (Fig. 1b); other rigid body 
moves are disallowed because they would lead to asymmetry. These four 
degrees of freedom are systematically explored during docking to iden- 
tify configurations with symmetrically repeated instances of a novel inter- 
building-block interface that is suitable for design (Fig. 1c). The score 
function used during docking favours interfaces with high densities of 
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Figure 1 | Overview of the computational design method. a, The T33 
architecture comprises four copies each of two distinct trimeric building blocks 
(green and blue) arranged with tetrahedral point group symmetry (24 total 
subunits; triangles indicate three-fold symmetry axes). b, Each building block 
has two rigid-body degrees of freedom, one translational (r) and one rotational 
(@), that are systematically explored during docking. c, The docking procedure, 
which is independent of the amino acid sequence of the building blocks, 
identifies large interfaces with high densities of contacting residues formed by 


contacting residues in well-anchored regions of the protein structure 
that are less likely to change conformation on mutation of surface side 
chains (Fig. 1d). RosettaDesign***’ is then used to sample the identities 
and configurations of the side chains near the inter-building-block 
interface, generating interfaces with features resembling those found 
in natural protein assemblies such as well-packed hydrophobic cores 
surrounded by polar rims” (Fig. le). The end result is a pair of new 
amino acid sequences, one for each building block, predicted to sta- 
bilize the modelled interface and drive assembly to the specific target 
configuration. 
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Figure 2 | Experimental characterization of co-assembly. a, SEC 
chromatograms of the designed pairs of proteins (solid lines) and the wild-type 
oligomeric proteins from which they were derived (dashed and dotted lines). 
The co-expressed designed proteins elute at the volumes expected for the target 
24-subunit nanomaterials, whereas the wild-type proteins elute as dimers or 
trimers. The T33-15 in vitro panel shows chromatograms for the individually 
produced and purified designed components (T33-15A and T33-15B) as well as 
a stoichiometric mixture of the two components. b, Native PAGE analysis of 
in vitro-assembled T32-28 (left panel) and T33-15 (right panel) in cell lysates. 
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well-anchored regions of the protein structure. The details of such an interface, 
boxed here, are shown in d. e, Amino acid sequences are designed at the 
new interface to stabilize the modelled configuration and drive co-assembly 
of the two components. f, In the T32 architecture, four trimeric (grey) and 
six dimeric (orange) building blocks are aligned along the three-fold and 
two-fold symmetry axes passing through the vertices and edges of a 
tetrahedron, respectively. 


These docking and design procedures were implemented by extend- 
ing the Rosetta software*'”’ to enable the simultaneous modelling of 
multiple distinct symmetrically arranged protein components. The new 
protocol allows the different components to be arranged and moved 
independently according to distinct sets of symmetry operators (Extended 
Data Fig. 1). This enables the design strategy described above to be gen- 
eralized to a wide variety of symmetric architectures in which multiple 
symmetric building blocks are combined in geometrically specific ways**”’. 
Combining even two types of symmetry elements (as in the present study) 
can give rise to a large number of distinct symmetric architectures with 
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Lysates containing the co-expressed design components (A1-tagged, lane 5; 
hexahistidine-tagged, lane 6) reveal slowly migrating species (‘24mers’, arrows) 
not present in lysates containing the wild-type or individually expressed 
components (lanes 1-4). Mixing equal volumes (e.v.) of crude lysates 
containing the individual designed components yields the same slowly 
migrating species (lane 7), although some unassembled building blocks remain 
due to unequal levels of expression (particularly for T33-15). When the 
differences in expression levels are accounted for by mixing adjusted volumes of 
lysates (a.v.), more efficient assembly is observed (lane 8). 
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a range of possible morphologies, including those with dihedral and 
cubic point-group symmetries, as well as helical, layer and space group 
symmetries (ref. 21 and T.O.Y., manuscript in preparation). 

In this study we targeted two distinct tetrahedral architectures: the 
133 architecture described above and the T32 architecture shown in 
Fig. 1f, in which the materials are formed from four trimeric and six 
dimeric building blocks aligned along the three-fold and two-fold tet- 
rahedral symmetry axes. We docked all pairwise combinations of a set 
of 1,161 dimeric and 200 trimeric protein building blocks of known 
structure in the T32 and T33 architectures (Supplementary Methods). 


ARTICLE 


Figure 3 | Designed interfaces of 
two-component protein 
nanomaterials. The models of the 
designed interfaces in each 
component (left, ‘A’ component; 
right, ‘B’ component) of T32-28 (a), 
133-09 (b), T33-15 (c), T33-21 (d) 
and T33-28 (e) are shown, and side 
views of each interface as a whole are 
shown at centre (see arrows 
indicating rotations at top). Each 
image is oriented such that a vector 
originating at the centre of the 
tetrahedral material and passing 
through the centre of mass of the 
designed interface would pass 
vertically through the centre of the 
image. The side chains of all amino 
acids allowed to change identity or 
conformation during the interface 
design procedure are shown in stick 
representation. The alpha carbon 
atoms of positions that were mutated 
during design are shown as spheres, 
and the mutations are labelled. To 
highlight the morphologies of the 
contacting surfaces, atoms within 5 A 
of the opposite building block are 
shown in semi-transparent surface 
representation. Oxygen atoms are 
red; nitrogen, blue; and sulphur, 
orange. 
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This resulted ina large set of potential novel nanomaterials: 232,200 and 
19,900 docked protein pairs, respectively, with a given pair often yield- 
ing several distinct promising docked configurations. Interface sequence 
design calculations were carried out on the 1,000 highest scoring docked 
configurations in each architecture, and the designs were evaluated on 
the basis of predicted binding energy, shape complementarity” and size 
of the designed interfaces, as well as the number of buried unsatisfied 
hydrogen-bonding groups (Supplementary Methods). After filtering 
on these criteria, 30 T32 and 27 T33 materials were selected for experi- 
mental characterization (Extended Data Fig. 2). The 57 designs were 
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derived from 39 distinct trimeric and 19 dimeric proteins, and contained 
an average of 19 amino acid mutations per pair of subunits compared to 
the native sequences. The designed interfaces resided mostly on elements 
of secondary structure, both a-helices and B-strands, with nearby loops 
generally making minor contributions. 


Screening and characterization of assembly state 


Synthetic genes encoding each designed pair of proteins were cloned in 
tandem in a single expression vector to allow inducible co-expression 
in Escherichia coli (Supplementary Methods). Polyacrylamide gel elec- 
trophoresis (PAGE) under denaturing and non-denaturing (native) con- 
ditions was used to rapidly assess the level of soluble expression and 
assembly state of the designed proteins in clarified cell lysates. For most 
of the designs, either one or both of the designed proteins was not de- 
tectable in the soluble fraction, suggesting that insoluble expression is 
a common failure mode for the designed materials. Given that the 
majority of the mutations introduced by our method are changes from 
polar to hydrophobic residues at the designed interfaces, it is likely that 
the insolubility of these designs is due to either misfolding or non-specific 
aggregation of the designed protein subunits. Nevertheless, several de- 
signed protein pairs yielded single bands under non-denaturing con- 
ditions that migrated more slowly than the wild-type proteins from 
which they were derived, suggesting assembly to higher-order species 
(Extended Data Fig. 3). These proteins were subcloned to introduce a 
hexahistidine tag at the carboxy terminus of one of the two subunits 
and purified by nickel affinity chromatography and size exclusion chro- 
matography (SEC). Five pairs of designed proteins—one T32 design 
(T32-28) and four T33 designs (T33-09, T33-15, T33-21 and T33-28)— 
eluted together during nickel affinity chromatography and yielded 
dominant peaks at the expected size of approximately 24 subunits when 
analysed by SEC (Fig. 2a and Supplementary Table 1). 

We tested the ability of each of the five materials to assemble in vitro 
by expressing the two components in separate E. coli cultures and mix- 
ing them at various points after cell lysis (Extended Data Fig. 3). Native 
PAGE revealed that in two cases (T33-15 and T32-28) the two sepa- 
rately expressed components efficiently assembled to give the designed 
materials in vitro when equal volumes of cell lysates were mixed (Fig. 2b, 
Extended Data Fig. 3a, c). Adjusting the volume of each lysate in the 
mixture to account for differences in the level of soluble expression of 
the two components allowed for more quantitative assembly. In the case 
of T33-15, the two components of the material could also be purified 
independently: T33-15A and T33-15B each eluted from the SEC col- 
umn as trimers in isolation. After mixing the two purified components 
in a 1:1 molar ratio and allowing a two-hour incubation at room tem- 
perature, the mixture eluted from the SEC column predominantly at 
the volume expected for the 24-subunit assembly, with small amounts 
of residual trimeric building blocks remaining (Fig. 2a). It is thus pos- 
sible to control the assembly of our designed materials by simply mix- 
ing the two independently produced components. 

The details of the designed interfaces for the five materials, high- 
lighting the shape and chemical complementarity generated by the many 
amino acid mutations introduced during design, are presented in Fig. 3. 
Qualitatively, the interfaces reflect the hypothesis underlying the design 
protocol: they feature well-packed and highly complementary cores of 
hydrophobic side chains residing mostly in elements of secondary struc- 
ture, surrounded by polar side chains lining the periphery of the hydro- 
phobic cores. The successful designs are quantitatively similar to the 
other designs according to the interface metrics used to select designs 
for experimental characterization (predicted binding energy, shape com- 
plementarity, interface size and number of buried unsatisfied hydrogen- 
bonding groups; Extended Data Fig. 4). The similarity of the successful 
and unsuccessful designs according to these structural metrics, combined 
with the observed insolubility of many of the designs, suggests that 
focusing on improving the level of soluble expression of the designed 
proteins could substantially improve the success rate of our approach 
in the future. 
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Figure 4 | Electron micrographs of designed two-component protein 
nanomaterials. Negative-stain electron micrographs for co-expressed and 
purified T32-28 (a), T33-09 (b), T33-15 (c), T33-21 (d) and T33-28 (e) are 
shown to scale (scale bar at top right, 25 nm). For each co-expressed material, 
two different class averages of the particles (top and bottom) are shown in the 
insets (left) alongside back projections calculated from the computational 
design models (right). f, Micrograph of a T33-15 sample prepared by 
stoichiometrically mixing the independently purified components (T33-15A 
and T33-15B) in vitro and purifying the assembled material by SEC (see Fig. 2). 
Micrographs of unpurified, in vitro-assembled T33-15 as well as T33-15A and 
T33-15B in isolation are shown in Extended Data Fig. 5. 


Structural characterization of the designed materials 
Negative-stain electron microscopy of the five designed materials con- 
firmed that they assemble specifically to the target architectures (Fig. 4). 
For each material, fields of monodisperse particles of the expected size 
and symmetry were observed, confirming the homogeneity of the mate- 
rials suggested by SEC. Particle averaging yielded images that recapitu- 
late features of the computational design models at low resolution. For 
example, class averages of T33-09 revealed roughly square or triangle- 
shaped structures with well-defined internal cavities that closely resem- 
ble projections calculated from the computational design model along 
its two-fold and three-fold axes (Fig. 4b, inset). Micrographs of T33-15 
assembled in vitro as described above were indistinguishable from 
those of co-expressed T33-15 (Fig. 4c, f and Extended Data Fig. 5c), 
demonstrating that the same material is obtained using both methods. 
We solved X-ray crystal structures of four of the designed materials 
(T32-28, T33-15, T33-21 and T33-28) to resolutions ranging from 2.1 
to 4.5 A (Fig. 5 and Supplementary Tables 2 and 3). In all cases, the 
structures reveal that the inter-building-block interfaces were designed 
with high accuracy: comparing a pair of chains from each structure to 
the computationally designed model yields backbone root mean square 
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deviations (r.m.s.d.) between 0.5 and 1.2A (Fig. 5 right and Extended 
Data Table 1). In the structures with resolutions that permit detailed 
analysis of side-chain configurations (T33-15 and two independent 
crystal forms of T33-21), 87 of 113 side chains at the designed inter- 
faces adopt the predicted conformations (Supplementary Tables 5 and 
6). As intended, the designed interfaces drive the assembly of cage-like 
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Figure 5 | Crystal structures of designed two-component protein 
nanomaterials. The computational design models (top) and X-ray crystal 
structures (bottom) are shown at left for T32-28 (a), T33-15 (b), T33-21 (c) and 
T33-28 (d). Views of each material are shown to scale along the two-fold and 
three-fold tetrahedral symmetry axes (scale bar at centre, 15 nm). The r.m.s.d. 
values between the backbone atoms in all 24 chains of the design models and 
crystal structures are indicated. For T33-21 (c), r.m.s.d. values are shown for 
both crystal forms (images are shown for the higher-resolution crystal form 
with backbone r.m.s.d. 2.0 A), while the r.m.s.d. range for T33-28 (d) derives 
from the four copies of the fully assembled material in the crystallographic 
asymmetric unit. At right, overlays of the designed interfaces in the design 
models (white) and crystal structures (grey, orange, green and blue) are shown. 
Owing to the limited resolution of the T32-28 structure, the amino acid side 
chains were not modelled beyond the beta carbon. For the interface overlays, 
the crystal structures were aligned to the design models using the backbone 
atoms of two subunits, one of each component. 


nanomaterials that closely match the computational design models: the 
backbone r.m.s.d. over all 24 subunits in each material range from 1.0 to 
26A (Fig. 5 left and Extended Data Table 1). The precise control over 
interface geometry offered by our method thus enables the design of 
two-component protein nanomaterials with diverse nanoscale features 
such as surfaces, pores and internal volumes with high accuracy. 


Discussion 


Owing to the unique functions accessible to self-assembling proteins, 
there is intense interest in engineering protein nanomaterials for appli- 
cations in various fields. Most efforts so far have focused on repurpos- 
ing naturally occurring protein assemblies, a strategy that is ultimately 
limited by the structures available and their tolerances for modification. 
Similarly, although directed evolution is a powerful method for protein 
engineering**”** and can be used to improve, for example, the packaging 
capability of existing protein nanocontainers**”,, it is difficult to envi- 
sion how it could accurately generate new protein nanomaterials with 
target structures defined at the atomic level. Our results demonstrate 
that computational protein design provides a general route for design- 
ing novel two-component self-assembling protein nanomaterials with 
high accuracy. The combinatorial nature of two-component materials 
greatly expands the number and variety of potential nanomaterials that 
can be designed. For example, in this study we used 1,361 protein build- 
ing blocks to dock over 250,000 distinct protein pairs in two target ar- 
chitectures with tetrahedral point group symmetry, resulting in a very 
large set of potential nanomaterials exhibiting a variety of sizes, shapes 
and arrangements of chemically and genetically addressable functional 
groups, loops and termini. With continued effort to increase the success 
rate of protein-protein interface design and reduce the rate of designed 
proteins that express insolubly, it should become possible to simulta- 
neously design multiple novel interfaces in a single material, which would 
enable the construction of increasingly complex materials built from 
more than two components. 

The conceptual framework that underlies our method—symmetric 
docking followed by protein-protein interface design—can be generally 
applied to a wide variety of symmetric architectures, including repet- 
itive protein arrays that extend in one, two or three dimensions. Multi- 
component materials are advantageous in these extended architectures 
because the uncontrolled self-assembly of a single-component material 
inside the cell can complicate biological production®"'”'. We have shown 
that the two components of the designed materials T32-28 and T33-15 
can be produced separately and mixed in vitro to initiate assembly of 
the designed structure. With new symmetric modelling algorithms cap- 
able of handling the additional degrees of freedom associated with these 
architectures, the accurate computational design and controllable assem- 
bly of complex, multi-component protein fibres, layers and crystals should 
also be possible. 

The capability to design highly homogeneous protein nanostructures 
with atomic-level accuracy and controllable assembly should open up 
new opportunities in targeted drug delivery, vaccine design, plasmonics 
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and other applications that can benefit from the precise patterning of 
matter on the subnanometre to 100-nanometre scale. Extending beyond 
static structure design, methods for incorporating the kinds of dynam- 
ic and functional behaviours observed in natural protein assemblies 
should make possible the design of novel protein-based molecular 
machines with programmable structures, dynamics and functions. 


METHODS SUMMARY 


The symmetric modelling framework in Rosetta*” was updated to enable the mod- 
elling of multi-component symmetrical structures. A new application, tedock, docks 
pairs of protein scaffolds in higher-order symmetries, scoring each docked config- 
uration according to its suitability for interface design. tcdock was used to dock all 
possible pairwise combinations of 200 trimeric scaffold proteins and all possible 
pairwise combinations of the same trimers and 1,161 dimeric proteins in the T33 
and T32 symmetric architectures, respectively. New two-component protein-protein 
interface design protocols were used to design new amino acid sequences predicted to 
stabilize selected docked configurations. During the sequence design protocols, 
the symmetric rigid body degrees of freedom and the identities and conformations 
of the side chains at the inter-building-block interfaces were optimized to identify 
low-energy sequence-structure combinations. Thirty T32 and 27 T33 designs were 
selected for experimental characterization. 

The assembly states of the designed pairs of proteins were assessed by native 
PAGE, and those that migrated more slowly than the wild-type scaffolds were sub- 
jected to affinity purification and SEC. The ability of the materials to assemble in vitro 
was investigated by independently producing the two components, mixing them 
at various points after cell lysis, and analysing the mixtures by native PAGE and 
SEC. The materials were structurally characterized by negative-stain electron micro- 
scopy and particle averaging, and at high resolution by X-ray crystallography. 
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Extended Data Figure 1 | Comparison of one-component and multi- 
component symmetric fold trees. Within the Rosetta macromolecular 
modelling suite, the connections between residues in a protein structure are 
represented as a directed, acyclic, graph referred to as a ‘fold tree*?**. When 
modelling multiple subunits in symmetric systems, the rigid body orientations 
of the subunits can be controlled by modifying the appropriate connections in 
the fold tree. In this work, we have extended Rosetta to allow multiple, 
independently managed connections from the fold tree to the subunits in the 
asymmetric unit (ASU) of the modelled structure. To demonstrate the new 
behaviour enabled by this change, three different symmetric fold tree 
representations of a D32 architecture are shown. In this architecture, which is 
used because of its relative simplicity, two trimeric building blocks (wheat) are 
aligned along the three-fold rotational axes of D3 point-group symmetry and 
three dimeric building blocks (light blue) are aligned along the two-fold 
rotational axes. a, The dimer-centric one-component symmetry case. 
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Rigid-body degree of freedom (RB DOF, black lines) Jp3 connecting the dimer 
subunit to the trimer subunit in the ASU is downstream of RB DOFs Jp, and Jp2 
controlling the dimer subunit; in this case the positions of the trimeric subunits 
depend on the positions of the dimeric subunits. b, The trimer-centric one- 
component symmetry case. RB DOF Jy3 connecting the trimer subunit to the 
dimer subunit in the ASU is downstream of RB DOFs Jy and J, controlling 
the trimer subunit; in this case the positions of the dimeric subunits depend on 
the positions of the trimeric subunits. c, The multi-component symmetry case. 
With multi-component symmetric modelling, the RB DOFs controlling the 
trimer subunit (J7, and Jy) and the dimer subunit (Jp; and Jp.) in the ASU are 
independent. In this case the positions of the dimeric subunits do not depend 
on the positions of the trimeric subunits and vice versa, allowing the internal 
DOFs for each building block (Jy2 and Jpz) to be maintained while moving the 
building blocks independently (Jr; and Jp;). See Supplementary Methods for 
additional discussion. 
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Extended Data Figure 2 | Models of the 57 designs selected for experimental 
characterization. Smoothed surface representations of each of the 30 T32 and 
27 T33 designs are shown. The trimeric component of each T32 design is shown 
in grey and the dimeric component in orange. The two different trimeric 
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components of each T33 design are shown in blue and green. The tetrahedral 
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two-fold and three-fold symmetry axes (black lines) are shown passing through 
the centre of each component. Each design is named according to its symmetric 
architecture (T32 or T33) followed by a unique identification number. The 

pairs of scaffold proteins from which the designs are derived are also indicated. 


Extended Data Figure 3 | Native PAGE analysis of cleared cell lysates. Each 
gel contains cleared lysates pertaining to a, T32-28, b, T33-09, c, T33-15, 

d, T33-21, or e, T33-28. Lane 1 is from cells expressing the wild-type scaffold 
for component A and lane 2 the wild-type scaffold for component B. Lanes 3 
and 4 are from cells expressing the individual design components and lanes 5 
and 6 the co-expressed components. Lanes 7 and 8 are from samples mixed 
as crude equal volume or crude adjusted volume (cr.e.v. or cr.a.v.) lysates, while 


ARTICLE 


<4 24mers 


Building 
| blocks 


lanes 9 and 10 are from samples mixed as cleared lysates (cl.e.v. or cl.a.v.). 
Lane 5 is from cells expressing the C-terminally A1-tagged constructs; all other 
lanes are from cells expressing the C-terminally His-tagged constructs. An 
arrow is positioned next to each gel indicating the migration of 24-subunit 
assemblies and the gel regions containing unassembled building blocks are 
bracketed. Each gel was stained with GelCode Blue. Portions of the gels in a and 
c are also shown in Fig. 2b. 
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Extended Data Figure 4 | Structural metrics for the computational design 
models. Selected metrics related to the designed interfaces are plotted for the 57 
designs that were experimentally characterized, including a, the predicted 
binding energy measured in Rosetta energy units (REU), b, the surface area 
buried by each instance of the designed interface, c, the binding energy density 
(calculated as the predicted binding energy divided by the buried surface area), 
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d, the number of buried unsatisfied polar groups at the designed interface, e, the 
shape complementarity of the designed interface, and f, the total number of 
mutations in each designed pair of proteins. Each circle represents a single 
design; the five successful materials are plotted as filled circles and labelled. In 
each plot, the designs are arranged on the x axis in order of increasing value of 
the metric analysed. 
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Extended Data Figure 5 | Electron micrographs of in vitro-assembled T33- _ T33-15B (b), as well as unpurified, in vitro-assembled T33-15 (c) are shown to 
15. Negative stain micrographs of independently purified T33-15A (a) and scale (scale bar at right, 25 nm). 
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Extended Data Table 1 | Root mean square deviations (r.m.s.d.) between crystal structures and design models 


Design model Crystal structure Global r.m.s.d. (A)* Two-chain r.m.s.d. (A)' Contents of asymmetric unit Structure used for superposition? 


T32-28 4NWN 2.586 1.246 One cage (24 subunits) Asymmetric unit 
T33-15 4NWO 1.433 0.876 One chain of each One cage generated from 
component (2 subunits) crystallographic 2- and 3-folds 
T33-21 ANWP. 1.962 0.924 4 chains of each One cage generated from 
component (8 subunits) one crystallographic 3-fold 
T33-21 4NWQ 1.482 0.765 One chain of each One cage generated from 
component (2 subunits) crystallographic 2- and 3-folds 
T33-28 4NWR 0.965 0.503 Four complete cages One complete cage from 
(96 subunits) the asymmetric unit 
T33-28 4NWR 0.965 0.548 Four complete cages One complete cage from 
(96 subunits) the asymmetric unit 
T33-28 4NWR 1.195 0.567 Four complete cages One complete cage from 
(96 subunits) the asymmetric unit 
T33-28 4NWR 1.212 0.477 Four complete cages One complete cage from 
(96 subunits) the asymmetric unit 


* Global backbone r.m.s.d. was calculated over all 24 subunits of each design model and corresponding subunits in each crystal structure. 

+Two-chain backbone r.m.s.d. was calculated over chains A and B of each design model and corresponding subunits in each crystal structure. 

£24 subunits composing one complete cage were derived from each crystal structure as indicated and the chains renamed to match the corresponding names in the design models. In the case of T33-28, four 
different sets of r.m.s.d. calculations were carried out, one for each of the four cages contained in the asymmetric unit of 4NWR. 
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Krishanu Mukherjee’, Tatiana P. Moroz, Yelena Bobkova', Fahong Yu’, Vladimir V. Kapitonov!”, Jerzy Jurka'’, Yuri V. Bobkov', 
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Jonathan V. Sweedler’, Evgeny I. Rogaev’”’*'®!”, Kenneth M. Halanych* & Andrea B. Kohn! 


The origins of neural systems remain unresolved. In contrast to other basal metazoans, ctenophores (comb jellies) have 
both complex nervous and mesoderm-derived muscular systems. These holoplanktonic predators also have sophisticated 
ciliated locomotion, behaviour and distinct development. Here we present the draft genome of Pleurobrachia bachei, 
Pacific sea gooseberry, together with ten other ctenophore transcriptomes, and show that they are remarkably distinct 
from other animal genomes in their content of neurogenic, immune and developmental genes. Our integrative analyses 
place Ctenophora as the earliest lineage within Metazoa. This hypothesis is supported by comparative analysis of multiple 
gene families, including the apparent absence of HOX genes, canonical microRNA machinery, and reduced immune com- 
plement in ctenophores. Although two distinct nervous systems are well recognized in ctenophores, many bilaterian 
neuron-specific genes and genes of ‘classical’ neurotransmitter pathways either are absent or, if present, are not expressed 
in neurons. Our metabolomic and physiological data are consistent with the hypothesis that ctenophore neural systems, 
and possibly muscle specification, evolved independently from those in other animals. 


Approximately 150 recognized species of comb jellies form a clade of 
pre-bilaterian animals’ ° (Fig. 1f) with an elusive genealogy, possibly 
tracing their ancestry to the Ediacaran biota*’. We selected the Pacific sea 
gooseberry, Pleurobrachia bachei (A. Agassiz, 1860; Figs 1a, 2, Extended 
Data Fig. 1, Supplementary Data 1 and Supplementary Videos 1 and 2) 
as a model ctenophore because of preserved traits thought to be ances- 
tral for this lineage (for example, cydippid larva and tentacles). Three 
next-generation sequencing platforms (454, Illumina and Ion Torrent) 
were used to obtain >700-fold coverage (Supplementary Tables 1 and 2) 
of Pleurobrachia’s genome, and about 2,000-fold coverage of the transcrip- 
tome representing all major organs and developmental stages (Supplemen- 
tary Tables 3 and 4). Consequently, the draft assembly was 156,146,497 
base pairs (bp) with 19,523 predicted protein-coding genes (Supplemen- 
tary Tables 5-7). About 90% of these predicted genes are expressed in at 
least one tissue or developmental stage (Supplementary Methods) and 
44% of Pleurobrachia genes have orthologues in other animals (Sup- 
plementary Tables 7 and 8). More than 300 families of transposable ele- 
ments constitute at least 8.5% of the genome (Supplementary Table 9 
and Supplementary Data 2) with numerous examples of diversification 
of some ancient transposable element classes (for example, transposases 
and reverse transcriptases). Approximately 1.0% of the genome is meth- 
ylated. Pleurobrachia also uses DNA demethylation during development, 
with both 5-hydroxymethyl cytosine (5-hmC) and its synthetic enzyme 


TET® (Extended Data Fig. 2). The obtained genome and transcriptome 
data provide rich resources (http://neurobase.rc.ufl.edu/Pleurobrachia) 
for investigating both animal phylogeny and evolution of animal inno- 
vations including nervous systems”*”?. 


Ctenophore phylogeny 

Although relationships among basal animal lineages are controversi 
our analyses (Supplementary Data 4) with Ctenophora represented by 
Pleurobrachia and Mnemiopsis suggest the placement of Ctenophora 
as the most basal animal lineage (Fig. 1 and Extended Data Fig. 3). Porifera 
was recovered as sister taxon to remaining metazoans (bootstrap sup- 
port = 100%) with Cnidaria sister to Bilateria (bootstrap support = 100%, 
Fig. 1f). Shimodaira—Hasegawa tests’” (corresponding to Extended Data 
Fig. 3a—c with 586 gene matrix) rejected both Eumetazoa (sponges sister 
group to all other metazoans) and Coelenterata (Cnidaria + Ctenophora). 
Placement of Ctenophora at the base of Metazoa also provides the most 
parsimonious explanation for the pattern of global gene gain/loss seen 
across major animal clades (Fig. 3a and Supplementary Table 14a, b). 
Transcriptome data from ten additional ctenophores (Supplementary 
Table 13) with stricter criteria for orthology inference (Supplementary 
Methods 7) also placed ctenophores basal, albeit with less support (Ex- 
tended Data Fig. 3d). When the most conserved set of genes was con- 
sidered (Supplementary Methods 7.5 and Supplementary Data 4.3), the 
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Figure 1 | Ctenophores and their 
innovations. a, The sea gooseberry, 
Pleurobrachia bachei 
(Supplementary Fig. 1), was selected 
as a target for genome sequencing 
due to preservation of traits ancestral 
for this lineage and because in situ 
hybridization/immunolabelling is 
possible. b-e, Major ctenophore 
innovations. b, Nervous system 
revealed by tyrosinated «-tubulin 
immunolabelling (scale bar, 60 1m). 
c, Scanning electron microscopy 
(SEM) imaging of nerve net ina 
tentacle pocket (scale bar, 20 tim). 
d, Locomotory ciliated combs (SEM; 
scale bar, 100 jim). e, Glue-secreting 
cells (colloblasts) in tentacles (SEM; 
scale bar, 20 um). f, Relationships 
among major animal clades with 
choanoflagellates sister to all 
Metazoa. 


topology was unresolved. Weak support is probably due to under- _ lack of other ctenophore and sponge genomes; Supplementary Methods 
representation of comparable transcriptomes from sponges and large _7.2) also rejected Coelenterata but not Eumetazoa. Notably, relation- 
protein divergence. Nevertheless, Shimodaira—Hasegawa tests basedon _ ships within Ctenophora were strongly supported (Fig. 2). Both cydip- 
expanded ctenophore sampling (with a reduced 114 gene matrix dueto _ pid and lobate ctenophores, previously viewed as monophyletic clades, 
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Figure 2 | Phylogenomic reconstruction among major ctenophore lineages. _ independent loss of both cydippid larval stage and tentacle apparatus as well as 
Cydippid (Euplokamis, Pleurobrachia, Dryodora and Mertensiidae) and lobate _ independent development of bilateral symmetry in benthic ctenophores, 
(Mnemiopsis and Bolinopsis) ctenophores were polyphyletic, suggesting Vallicula and Coeloplana (Supplementary Data 4). 


110 | NATURE | VOL 510 | 5 JUNE 2014 
©2014 Macmillan Publishers Limited. All rights reserved 


a 
Bilateria Ctenophores 
Pro Deu Cnidaria Placozoa Porifera Monosiga 
1,728 7,119 
709 Jf 
+Muscles -593 +Neurons 
° Miscles |e SS 'e cr a] eee ee 
+Neurons -4,952 
+Hox cal AL 
+miRNAs 7,771 
+Collagen 
Gap junctions 


Immunity (Toll/Nod) 


ARTICLE 


b Pleurobrachia 
10,899/19,523 


Capsaspora 


6,083 


Amphimedon e287 
12,050/30,060 aye 


2,894 


Trichoplax 
18,702/27,416 


Die 2,591 


10,537 5,803 


7,208 


Nematostella 
15,257/27,273 


Figure 3 | Gene gain and gene loss in ctenophores. a, Predicted scope of gene 
loss (blue numbers; for example, —4,952 in Placozoa) from the common 
metazoan ancestor. Red and green numbers indicate genes shared between 
bilaterians and ctenophores (7,771), as well as between ctenophores and other 
eukaryotic lineages sister to animals, respectively. Text on tree indicates 


were recovered polyphyletic, indicating independent loss of both the 
cydippid larval stage and tentacle apparatus. Interestingly, Platyctenida 
was the second basal-most branch in the Ctenophore clade, suggesting 
that their benthic and bilaterial nature are secondarily derived. 

A highly reduced complement of animal-specific genes is a feature 
shared for the entire ctenophore lineage (Fig. 3 and Supplementary 
Table 15). HOX genes involved in anterio-posterior patterning of body 
axes and present in all metazoans are absent in ctenophores and sponges'® 
(Supplementary Tables 17 and 18). Likewise, canonical microRNA 
machinery (that is, Drosha and Pasha, Supplementary Table 19) is lack- 
ing in Pleurobrachia and other ctenophores. Using small RNA sequencing 
from Pleurobrachia, Bolinopsis and Beroe, we were unable to experi- 
mentally detect microRNAs (Supplementary Data 5.4). Pleurobrachia 
also lacks major elements that initiate innate immunity such as pattern 
recognition receptors (Toll-like, Nod-like, RIG-like, Ig-Toll-IL-1R domain) 
and immune mediators, MyD88 and RHD transcription factors, that are 
present in bilaterians, cnidarians and, in divergent forms, in sponges’?”° 
(Supplementary Table 20). 

Key bilaterian myogenic/mesoderm-specification genes are absent 
in Pleurobrachia’s genome and transcriptomes of ten other ctenophores 
(Supplementary Table 35). These data indicate that muscles” and, pos- 
sibly, mesoderm evolved independently in Ctenophora to control the 
hydroskeleton, body shape and food capture. Thus, ctenophores might 
have independently developed complex phenotypes and tissue organ- 
ization, raising questions about the nature of ctenophore-specific traits 
such as their unique development, combs, tentacles, aboral/apical organs 
and nervous systems. 


Ctenophore innovations 

To assess the genomic bases of ctenophore-specific innovations, we per- 
formed RNA-seq profiling of the major developmental stages (Fig. 4a, b) 
as well as adult organs and identified genes differentially expressed in 
these structures. Many Pleurobrachia genes that have no homologues 
in other species are specifically expressed and most abundant during the 
4- to 32-cell cleavage stages as well as in tentacles, combs and the aboral 
organ (Fig. 4b and Extended Data Fig. 4). Thus, structures that are known 
as ctenophore innovations (Fig. 1d, e) have the largest complement of 
highly expressed Pleurobrachia/ctenophore-specific genes. These data 
suggest extensive gene gain in cell lineages associated with early segre- 
gation of developmental potential leading to ctenophore-specific traits 
in structures controlling feeding, locomotion and integrative functions; 
a finding consistent with proposed ‘orphan’ genes contributing to vari- 
ation in early development and evolution of novelties***. 

Examples of known metazoan gene families that are considerably 
expanded in Ctenophora (Supplementary Data 5 and Supplementary 
Table 16) include collagens, RNA-editing enzymes and RNA-binding 
proteins (Supplementary Data 5). Pleurobrachia’s genome encodes the 


emergence of complex animal traits and gene families. Deu, deuterostomes; 
Pro, protostomes. b, Uniquely shared and lineage-specific genes among basal 
metazoans. Values under species names indicate number of genes (bold) 
without any recognized homologues (e-value is 10” *) versus the total number 
of predicted gene models in relevant species (Supplementary Table 14b). 


highest number of RNA-editing enzymes (ADARI-4, ADAT1-3, CDA1-2) 
reported so far among metazoans**” (Supplementary Data 5.5), possibly 
acting as the generalized mechanism generating post-transcriptional 
diversity and ctenophore-specific traits in locomotory and integrative 
structures (combs and aboral organ). Matching the expansion of RNA 
regulatory mechanisms, Pleurobrachia has more RNA-binding proteins 
(especially RRM/ELAV, KH and NOVAs”*”’, Supplementary Table 21) 
than any basal metazoan or choanoflagellate examined. Dozens of RNA- 
binding proteins are selectively expressed and abundant during 8-64-cell 
stages (Supplementary Table 31), and might contribute to sequestration 
of RNAs and segregation in developmental potential leading to early 
cell-fate specification. 

Phenotypic complexity positively correlates with presence and high 
differential expression of 92 homeodomain Pleurobrachia genes (Sup- 
plementary Data 5.2 and Supplementary Table 17)—76 genes have been 
reported in Mnemiopsis'*, whereas the Amphimedon homeodomain 
complement consists of only 32 genes. In contrast, some developmental 
pathways are either absent (Hedgehog, JAK/STAT) or have reduced rep- 
resentation in ctenophores (TGF-B, Wnt, Notch). Surprisingly, most 
Wnt genes are weakly expressed during Pleurobrachia development, 
whereas the ctenophore-specific subtype WntX is primarily restricted 
to adult neuroid elements such as polar fields, aboral organ and ten- 
tacular conductive tracts (Extended Data Fig. 5e), suggesting a distinct 
molecular make-up of neural systems. 


Parallel evolution of neural organization 

Extensive parallel evolution of neural organization in ctenophores is the 
most evident. Compared to other animals with nervous systems, many 
genes controlling neuronal fate and patterning (for example, neurogenins, 
NeuroD, Achaete-scute, REST, HOX, Otx) are absent in the ctenophores 
that we sampled. Orthologues of pre- and postsynaptic genes also have 
limited representation (Supplementary Table 34), and they lack com- 
ponents (for example, neuroligin) critical for synaptic function in other 
eumetazoans. 

Notably, our combined molecular, ultrasensitive metabolomic, immu- 
nohistochemical and pharmacological data strongly suggest that cteno- 
phores do not use serotonin, acetylcholine, dopamine, noradrenaline, 
adrenaline, octopamine, histamine or glycine as intercellular messengers 
(Extended Data Figs 6 and 7g, Supplementary Data 5.8 and Supplemen- 
tary Tables 22-26). Lack of ionotropic receptors for these molecules in 
ctenophores is consistent with this conclusion (Supplementary Table 26a). 
Most genes involved in the synthesis of neurotransmitters are absent 
in non-metazoan opisthokonts Monosiga and Capsaspora, suggesting 
that they are cnidarian/bilaterian innovations. 

We next investigated the identity of the ctenophore transmitters. Phys- 
iological and pharmacological tests suggest that L-glutamate is a candi- 
date neuromuscular transmitter in Pleurobrachia (Fig. 5b, Extended Data 
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Figure 4 | Nature of ctenophore innovations. a, Main developmental stages 
in Pleurobrachia from eggs to cleavage (2-64 cells), gastrulation (1-3 h) and 
formation of cydippid larvae (~24h). b, Hierarchical clustering of 
approximately 400 ctenophore-specific genes differentially expressed among 
different development stages and adult structures as revealed by RNA-seq 
experiments. Colour index: black indicates highest level of expression, followed 
by purple, red then down to white indicating no expression. Most of these 
ctenophore-specific genes are primarily expressed during 4-32-cell stages 
(asterisks). The red circle indicates a subset of novel genes uniquely expressed in 
combs, tentacles and the aboral organ (AO). These genes lack recognized 
homologues in other organisms. c, Diversity and differential expression of 
RNA-editing genes in Pleurobrachia development and adult tissues (RNA-seq). 
ADARI has the highest expression level in early cleavage stages whereas 


Fig. 7), able to induce rapid inward currents and raise intracellular Ca”* 
concentration in muscle cells causing muscle contractions at nanomolar 
concentrations (10° ” M). In contrast, all other classical neurotransmit- 
ters were ineffective even in concentrations up to 5 X 10° * M whereas 
D-glutamate as well as L-/D-aspartate have significantly less affinity in 
these assays (Fig. 5b). 

The proposed role of glutamate as a signal molecule in ctenophores 
is supported by an unprecedented diversity of ionotropic glutamate 
receptors (iGluRs; Extended Data Fig. 7a, b and Supplementary Table 27)— 
far exceeding the number of genes encoding iGluRs in other basal 
metazoans**. iGluRs might have undergonea substantial adaptive radi- 
ation in Ctenophora as evidenced by unique exon/intron organization 
for many subtypes, and ctenophoran iGluRs form a distinct clade within 
the gene tree. Notably, during development, Pleurobrachia’s neurons 
are formed 2 days after the initial muscle formation, and first neurogen- 
esis events correlate with co-expression of all iGlu receptors in hatching 
larvae (Fig. 4d). All cloned iGluRs also show remarkable cell-type- 
specific distribution with predominant expression in tentacles, followed 
by combs and the aboral organ, revealing well-developed glutamate sig- 
nalling in adults (Extended Data Fig. 7b). Additionally, Pleurobrachia 
contains more genes for glutamate synthesis (8 glutaminases) and trans- 
port (8 sialins) than any other metazoan investigated”*”*. Although we 
detected GABA (y-aminobutyric acid, Supplementary Tables 22-24) 
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ADAR2 and ADAR3 and ADATI and ADAT2 are most abundant in the combs. 
d, Morphological appearance of neurons during the third day of development 
(the top inset, neuronal cell bodies are stained with tyrosinated «-tubulin 
antibodies, red arrows) correlates with abundant expression of multiple iGluR 
receptors, suggesting that glutamate has an important role as an intercellular 
messenger. Muscles formed well before neuronal differentiation at the end of 
the first day of development (the bottom insert, phalloidin staining, yellow 
arrow); white arrow points to the embryonic mouth with hundreds of cilia 
inside. In cand d expression levels of RNA-editing or iGluR genes are shown as 
a normalized frequency of sequence reads for a given transcript from all RNA- 
seq data for each developmental stage (Supplementary Methods). Scale bars in 
d: 15 jum (top); 40 um (bottom). 


and immunohistochemically identified its localization in muscles, lack 
of pharmacological effects of GABA on Pleurobrachia behaviour and 
major motor systems, such as cilia, muscle and colloblasts, suggest that 
GABA is a by-product of glutamate metabolism by L-glutamic acid 
decarboxylase. 

The first nervous systems are suggested to be primarily peptidergic 
in nature’. Although we did not find any previously identified neuro- 
peptide homologue, secretory peptide prohormone processing genes 
(Supplementary Table 31) are present. We predicted 72 novel putative 
prohormones in Pleurobrachia and found >50 homologues in other 
sequenced ctenophores (Extended Data Fig. 8 and Supplementary Tables 
28 and 32). Functions of these prohormone-derived peptides could include 
cell-to-cell signalling, toxins, involvement in innate immunity, or a com- 
bination of these functions. Several ctenophore-specific precursors are 
expressed in polarized cells around the mouth, tentacles and polar fields, 
indicative of a signalling role (Extended Data Fig. 8b). Secreted products 
of these prohormones may be natural ligands for >100 orphan neuro- 
peptide-like G-protein-coupled receptors*’ identified in Pleurobrachia 
(Supplementary Table 26b). A second example of neuropeptide receptor 
candidates is amiloride-sensitive sodium channels (ASIC), which are 
also known to be regulated by different classes of short peptides and 
protons”. Pleurobrachia’s genome has 29 genes encoding ASICs—more 
than any organism sequenced so far—and expression of most correlated 
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Figure 5 | Emergence of neural organization in Pleurobrachia. a, Two 
neural nets in Pleurobrachia as revealed by tyrosinated o-tubulin 
immunostaining. Top image shows subepithelial net with concentrations of 
neuronal elements in the polar fields and ciliated furrows, known as structures 
involved in sensory and motor functions, respectively (blue arrow in right insert 
indicates location of a neuronal somata with individual neurites marked by 
red arrows). The bottom image shows neurons of the mesogloeal net (arrows 
are neuronal somata; arrowheads are neuronal processes). Note that phalloidin 
(a muscle marker) did not stain these cells. Scale bar, 120 um (top); 10 um 
(bottom images). b, L-glutamate (107-107? M) induced action potentials in 
muscle cells, whereas other transmitter candidates were ineffective even at 
concentrations up to 5 mM. Typical responses of ctenophore muscle cells to 
local pulses of a transmitter application were recorded both as individual action 
potentials (whole-cell current-clamp mode) and video contractions from a 
single muscle cell. The graph shows normalized responses from the same 
muscle cell, indicating that L-glutamate is the most potential excitatory 


with developmental appearance of neurons (Supplementary Table 31). 
ASIC expression is most abundant in tentacles, combs and aboral organs— 
structures enriched in neural elements and under complex synaptic control. 

Moreover, ctenophores evolved an enormous diversity of electrical 
synapses (absent in Nematostella, Amphimedon and Trichoplax) with 
12 gap junction proteins (pannexins/innexins”* but not chordate-specific 
connexins) found in Pleurobrachia. All pannexins/innexins have their 
highest expression in the aboral organ followed by tentacles and combs 
(Fig. 5d). Non-metazoans lack pannexin orthologues, suggesting that 
these are metazoan innovations with profound expansion of this family 
in ctenophores. The aboral organ, combs and tentacles have a relatively 
large diversity of ion channels (Extended Data Fig. 9b), confirming com- 
plex regulation of excitability in these structures. However, the overall 


molecule compared to D-glutamate or L-/D-aspartate (Supplementary 
Methods). ACh, acetylcholine. c, Key molecular innovations underlying neural 
organization in ctenophores. Bars indicate the presence or relative expansions 
of selected gene families in all basal metazoan lineages from the inferred 
urmetazoan ancestor. The data suggest that sponges and placozoans never 
developed neural systems, or, unlikely assuming the presence of neuronal 
organization in the urmetazoan ancestor, sponges and placozoans lost their 
nervous systems. Either hypothesis points towards extensive parallel evolution 
of neural systems in ctenophores versus the Bilateria+ Cnidaria clade. d, The 
aboral organ has the greatest diversity and highest expression levels of 12 gap 
junction proteins, suggesting unmatched expansion of electrical signalling in 
this complex integrative organ—an analogue of an elementary brain in 
ctenophores. Expression of different innexins is shown as a summation of 
normalized frequencies of respective sequencing reads in RNA-seq data 
obtained from each developmental stage and adult tissues (y axis is “expression 
frequency or ‘expression’; see Supplementary Methods). 


complement of voltage-gated ion channels in ctenophores is reduced 
compared to other eumetazoans** (Extended Data Fig. 9a). 

Our genome-wide survey also indicates that some bilaterian and 
cnidarian pan-neural markers are present (for example, three elav and 
musashi genes), but they are not expressed in neurons; a finding con- 
sistent with early divergence and extreme parallel evolution of neural 
systems in this lineage (Extended Data Figs 5 and 9b). 


Discussion 

Figure 5c summarizes key molecular innovations underlying neural orga- 
nization in ctenophores. Evidently, with a markedly different molecular 
and genomic make-up, ctenophores have achieved complex phenotypic 
plasticity and tissue organization. Thus, ctenophores might represent 
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remarkable examples of convergent evolution including the emergence 
of neuro-muscular organization from the metazoan common ancestor 
without differentiated nervous system or bona fide neurons (Extended 
Data Fig. 10b and Supplementary Data 15). The alternative ‘single- 
origin hypothesis’, where the common ancestor of all metazoans had a 
nervous system with complex molecular and transmitter organization 
including all classical cnidarian/bilaterian transmitters and neurogenic 
genes (Extended Data Fig. 10a), is a less parsimonious scenario. This 
hypothesis implies that ctenophores, despite being active predators, under- 
went massive loss of neuronal and signalling toolkits and then replaced 
them with novel neurogenic and signalling molecules and receptors. 

These findings are of relevance for regenerative and synthetic bio- 
logy in designing novel signalling pathways and systems. In this case, 
ctenophores and their genomes present matchless examples of “exper- 
iments’ in nature and the possible preservation of ancient molecular 
toolkits lost in other animal lineages. 


METHODS SUMMARY 


Genome sequencing and transcriptome profiling used high-throughput methods. 
Gene prediction (Augustus/Fgenesh) and annotations were complemented by meta- 
bolome and microchemical assays using capillary electrophoresis with laser-induced 
fluorescence and mass spectrometry detectors to validate the presence/absence of 
selected transmitters. RNA-seq, PCR with reverse transcription (RT-PCR), in situ 
hybridization and immunohistochemistry were used to characterize and localize gene 
expression in adults and embryos. Pharmacological and electrophysiological tests 
were performed on semi-intact preparations using freshly isolated ctenophore cells. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Source material. Animals (Pleurobrachia bachei, Euplokamis dunlapae, Dryodora 
glandiformis, Beroe abyssicola, Bolinopsis infundibulum and Mertensiid) were col- 
lected at Friday Harbour Laboratories (Pacific North-Western Coast of USA) and 
maintained in running seawater for up to 2 weeks. Other species were collected at 
the Atlantic coast of Florida and around Woods Hole, Massachusetts (Pleurobrachia 
pileus, Pleurobrachia sp., Mnemiopsis leidyi) as well as central Pacific (Palau, Hawaii, 
Coeloplana astericola, Vallicula multiformis). Animals were anaesthetized in 60% 
(volume/body weight) isotonic MgCl, (337 mM). Specific tissues were surgically 
removed with sterile fine forceps and scissors and processed for DNA/RNA isola- 
tions as well as metabolomics or pharmacological/electrophysiological tests. Whole 
animals were used for all in situ hybridization and immunohistochemical tests as 
described**. Genomic DNA (gDNA) was isolated using Genomic-tip (QIAGEN) 
and total RNA was extracted using RNAqueous-Micro (Ambion/Life Technology) 
or RNAqueous according to manufacturers’ recommendations. Quality and quan- 
tity of gDNA was analysed on a Qubit2.0 Fluorometer (Life Technologies) and for 
RNA we used a 2100 Bioanalyzer (Agilent Technologies). For all details see Supplemen- 
tary Methods sections 1.1-1.3. 

Genome sequencing. All genomic sequence data for de novo assembly were gen- 
erated on Roche 454 Titanium and Illumina Genome Analyzer IIx, HiSeq2000 and 
MiSeq instruments using both shotgun pair-end and mate-pair sequencing librar- 
ies with 3-9 kb inserts as summarized in Supplementary Tables 1 and 2. Shotgun 
sequencing was performed from a single individual. Owing to a limited amount of 
starting gDNA, mate pair libraries were constructed from 10-12 individuals. In 
total, the genome sequencing is composed of 132,015,600,107 bp or ~132 Gb of 
data, which corresponds to 733-825 physical coverage of the Pleurobrachia genome 
(the size of the P. bachei genome is estimated to be ~ 160-180 Mb); see Supplemen- 
tary Methods sections 1.4-2.1.2. 

Genome assemblies. The Pleurobrachia bachei draft genome was assembled using 
a custom approach designed to leverage the individual strengths of three popular 
de novo assembly packages and strategies: Velvet**, SOAPdenovo”, and pseudo- 
454 hybrid assembly with ABySS**. First, using filtered and corrected data, we per- 
formed individual assemblies from 454 and Illumina reads by the Newbler (Roche, 
Inc.) software. Then the merged/hybrid assembly was achieved using three indivi- 
dual assemblies (SOAPdenovo, Velvet and ABySS/Newbler as described in Supplemen- 
tary Methods 2.2). Three gene model predictions were performed by Augustus” 
and Fgenesh predictions with the Softberry Inc. Fgenesh+ + pipeline“ to incor- 
porate information from full-length cDNA alignments and similar proteins from 
the eukaryotic section of the NCBI NR database”. After initial gene predictions in 
each of the three sets of genomic scaffolds, we screened each set of gene models for 
internal redundancy with the BLASTP program from NCBI’s BLAST + software 
suite’. A model was considered redundant if it: had 90% identity to other model; 
the alignment between the two models hada bit score of at least 100; and the model 
was shorter than the other model. 

Scaffolds producing these gene models were pooled and then screened for pro- 
karyotic contamination using UCSC’s BLAT software package“ to produce the draft 
genome assembly version 1.0 (statistics can be found in Supplementary Table 5 and 
Supplementary Methods 2). 

Genome annotation. For annotation, gene models were uploaded to the In- VIGO 
BLAST interface, a blastp alignment of gene models was performed against the entirety 
of NCBI’s non-redundant protein database and the Swiss-Prot protein database, 
and subsequently annotated in terms of Gene Ontology and KEGG pathways as 
well as Pfam domain identification. Transposable elements (TEs) were identified 
using not only WU-BLAST and its implementation in CENSOR but also databases 
for all known classes, superfamilies and clades of TEs described in the literature 
and/or collected in Repbase**. Detected sequences have been clustered based on their 
pairwise identities by using BLAST clust. All autonomous non-LTR retrotranspo- 
sons have been classified based on RT class] (ref. 46). To merge partially predicted, 
non-redundant gene models with assembled transcriptome data, a custom Java tool 
was developed. This Java tool extended partial gene model predictions based on 
using transcriptome sequences to bridge 5’ and 3’ fragments of partially predicted 
genes. Using this Java tool, analysis of alignments of non-redundant gene models to 
assembled Pleurobrachia transcriptomes resulted to 19,523 (Supplementary Table 30) 
gene models. These gene models were used to also identify their possible homo- 
logues in assembled transcriptomes from 10 other ctenophore species sequenced 
(Supplementary Tables 10 and 11). All genomic sequences were submitted to NCBI 
on SRA accession number Project SRP001155 (Supplementary Methods 3.1-3.2). 
Transcriptome sequencing and annotation. Three sequencing technology plat- 
forms were used for transcriptome profiling (RNA-seq): Roche 454 Titanium, 
Illumina HiSeq2000 and Ion Proton/PGM (Ion Torrent, Life Technologies). RNA- 
seq was performed from all major embryonic and developmental stages (1 cell, 2 
cells, 4 cells, 8 cells, 16 cells, 32 cells, 64 cells, early and later gastrula, 1 day and 3 day 
larvae), major adult tissues and organs (combs, mouth, tentacles, stomach, the aboral 
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organ, body walls), and whole body of Pleurobrachia bachei. We developed a 
reduced representation sequencing protocol for the 454 and Ion Torrent sequenc- 
ing platforms that can detect low abundance transcripts’’. The method reduces 
the amount of sequencing and gives more accurate quantification and additional 
details of the procedure are reported elsewhere*”’. In summary, we have generated 
499,699,347 reads or ~47.9 Gb to achieve approximately 2,000 coverage of the 
Pleurobrachia transcriptome. 

In addition, Illumina HiSeq sequencing was also performed with RNA extracted 
from the following ctenophore species: Euplokamis dunlapae, Coeloplana aster- 
icola, Vallicula multiformis, Pleurobrachia pileus, Pleurobrachia sp. (collected from 
the Middle Atlantic and later identified as a subspecies of P. pileus), Dryodora 
glandiformis, Beroe abyssicola, Mnemiopsis leidyi, Bolinopsis infundibulum and an 
undescribed species which belongs to the family Mertensiidae (Supplementary Table 3). 
Each sequencing project was individually assembled using the Trinity de novo assembly 
package” and in selected cases using MIRA. Reads from developmental stages were 
also assembled using the CLCBio Genomics Workbench. Before each assembly, reads 
were quality trimmed and had adaptor contamination removed with cutadapt™. Full 
summaries of the transcriptome assemblies are presented in Supplementary Tables 4 
and 10. Each transcriptome was mapped to the Pleurobrachia genome, and aligned 
to both NCBI’s non-redundant protein database (NR) and the UniProtKB/Swiss- 
Prot (SP) protein database. Gene Ontology” and Kyoto Encyclopedia of Genes and 
Genomes’ (KEGG) terms were associated with each transcript. By first trans- 
lating transcripts in all six reading frames, Pfam/SMART domains™ were assigned 
to each reference transcriptome. 

Each reference transcriptome and its full set of annotation and expression data 
was uploaded to our transcriptome database http://moroz.hpc.ufl.edu/slimebase2/ 
browse.php for downstream analysis and visualization®”**. The database is inte- 
grated with UCSC type genome browser. Via the genome project homepage (http:// 
neurobase.rc.ufl.edu/Pleurobrachia) all data sets have direct download options. Quan- 
tification of gene expression profiling was performed on all transcriptional data as 
described in Supplementary Methods 4.4. Hierarchical clustering was performed 
by Spotfire agglomerative algorithm. All primary transcriptome data was submit- 
ted to NCBI on SRA accession number Project SRP000992. (See Supplementary 
Methods 4.1-4.2.3 for details.) 

Phylogenetic analyses. To reconstruct basal metazoan phylogeny (see controver- 
sies in'°-'°°”), we conducted two sets of phylogenomic analysis using tools described 
elsewhere*. All analyses included new data from Pleurobrachia bachei and the 
sponges Sycon (Calcarea) and Aphrocallistes (Hexactinellida). For the first set of ana- 
lyses, Ctenophora was represented by two species of Pleurobrachia and Mnemiopsis 
leidyi. Initial analyses included the taxa in Supplementary Table 12. For a subsequent 
analysis, sampling within Ctenophora was expanded to include ten additional taxa, 
each represented by a relatively deeply sequenced Illumina transcriptome (Supplemen- 
tary Table 13). In order to reduce noise in the phylogenetic signal, we used strict 
criteria to exclude paralogues, highly derived sequences, mistranslated sequence 
regions, and ambiguously aligned positions in sequence alignments. Analyses were 
conducted in RAxML 7.2.7 (refs 59) using maximum likelihood (ML) with the 
CAT +WAG + F model. Topological robustness (that is, nodal support) for all ML 
analyses was assessed with 100 replicates of nonparametric bootstrapping. Details 
of phylogenomic analyses are presented in Supplementary Methods 7. Shimodaira— 
Hasegawa test’” was implemented in RAxML with the PROTGAMMAWAGEF model”. 

In order to examine evolution of single genes or gene families, alignments were 
performed with either ClustalX2° © or Muscle then, if appropriate, either trimmed 
manually or trimmed using GBlocks™ to exclude ambiguously aligned positions. 
Once alignments were obtained, gene trees were reconstructed in MEGA 5 using 
ML with the Whelan and Goldman (WAG) model. The bootstrap consensus tree 
was inferred from 100 replicates. All positions containing gaps and missing data 
were eliminated. Pfam composition™, Gene Ontology*' and KEGG**** were used 
to further validate P. bachei orthologues. Analyses of gene gain and gene loss were 
performed using custom scripts as described elsewhere® and in Supplementary 
Methods 7. 

Analysis of DNA methylation. ELIZA-based colorimetric assays (Epigenteck) 
were performed to quantify both global 5-mC and 5-hmC methylation in the P. 
bachei genome. A total of 6 individual P. bachei and three rat (positive control) 
were used (Supplementary Methods 1.2). Three biological and technical replicates 
were performed for every sample. Absolute quantification of 5-mC and 2-hmC 
were determined and date is reported as a mean = s.e.m. (Supplementary Methods 8). 
Molecular cloning, in situ hybridization and immunohistochemistry. Methods 
were similar as reported elsewhere**”**” with some modifications (Supplemen- 
tary Methods 9-11). 

Scanning electron microscopy. Animals were fixed in 2.5% glutaraldehyde in 
0.2 M phosphate-buffered saline (pH 7.6) for 3-4 h at room temperature, and washed. 
For secondary fixation, we used 2% osmium tetroxide in 1.25% sodium bicarbonate 
for 2-3 h at room temperature. After dehydration in ethanol, samples were placed 
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for drying in Samdri-790 Critical Point Drying. After drying the samples were coated 
on Sputter Coater. SEM observations and recordings were done on NeuScope JCM- 
5000 microscope (Supplementary Methods 12). 

Electrophysiological methods, calcium imaging and pharmacological assays. 
Patch electrodes for extracellular and whole-cell recordings were pulled from boro- 
silicate capillary (P-87, Sutter Instruments). All currents were recorded using an 
Axopatch or 200B amplifier controlled by a Digidata 1322A and pClamp 9.2. Action 
potentials (APs, spikes) were recorded in track mode using cell-attached loose-patch 
configuration. Whole-cell currents were recorded in voltage-clamp mode at a hold- 
ing potential of —70 mV. Neurotransmitter candidate (see Supplementary Method 
15) application for both extracellular AP and whole cell recordings were performed 
with a rapid solution changer, RSC-160 (Bio-Logic-Science Instruments, France). 
Data were analysed with Clampfit 9.0 (Molecular Devices) in combination with 
SigmaPlot 10.0. Videomicroscopy and time-lapse series were acquired with QImaging 
EXi CCD camera using DIC mode of Nikon Eclipse 2000 inverted microscope. Cal- 
cium imaging was performed on isolated ctenophore muscle cells using an Olympus 
IX-7linverted microscope equipped with a cooled CCD camera (ORCA R2, Hama- 
matsu). Cells were injected with calcium-sensitive probe (Fluo-4, ~5 1M) through 
a patch pipette. Fluorescence imaging was performed under the control of Imaging 
Workbench 6 software. Stored time series image stacks were analysed off-line using 
Imaging Workbench 6, Clampfit 10.3, SigmaPlot 10/11 or exported as TIFF files 
into Image] 1.42. Pharmacological tests and behavioural assays with video record- 
ing were performed on intact animals in 5-401 aquaria or on semi-intact prepara- 
tions in a Sylgard-coated Petri dish with free cilia beating and muscle contractions. 
To monitor and quantify cilia movements we used glass microelectrodes filled with 
2M potassium acetate with resistances of 5-20 MQ with electrical signals recorded 
by A-M System amplifiers (Neuroprobe 1600) and Gould Recorder (WindoGraf 980). 
Determination of the presence of classical neurotransmitters by capillary 
electrophoresis (CE). Two CE separation techniques were used to analyse tissue 
extracts for the presence of a number of neurotransmitters (Supplementary Tables 22 
and 23; Supplementary Methods 17). While both methods used CE separations, 
complimentary detection methods, laser-induced native fluorescence (LINF)* and 
electrospray ionization mass spectrometry (ESI-MS) were used to ensure broad 
coverage and low detection limits for the specific analytes of interest. Whole bodies 
of small animals as well as individual organs and tissues were removed, rinsed with 
ultrapure water and analytes were extracted using 49.5/49.5/1, methanol (LC-MS 
grade)/water/glacial acetic acid (99%) by volume, homogenized, centrifuged and 
supernatant was removed and frozen at —80 °C until analysis. The CE-LINF instru- 
ment used ultraviolet excitation at 264 nm and the native fluorescence emission was 
collected and recorded using a UV-enhanced CCD array (Spec-10; 2KBUV/LN; 
Princeton Instruments). CE separations were performed by hydrodynamic injection 
of 10 nl of sample and using 25 mM citric acid (pH 2.5, applied voltage +30 kV) or 
50 mM borate (pH 9.5, applied voltage +21 kV). Analytes were identified based on 
comparison of both the migration time and fluorescence spectrum to that of stan- 
dard mixtures of analytes. CE-ESI-MS analysis was performed using a Bruker Microtof 
or a Maxis (Bruker Daltonics) mass spectrometer for detection. All separations 
were performed using 1% formic acid in water as the electrolyte and applied voltage 
of +30kV. Sheath liquid was 0.1% formic acid in 50/50 methanol/water. Samples 
were hydrodynamically injected for a total volume of ~6 nl. Mass spectra were col- 
lected and recorded at a rate of 2 Hz with calibration performed using sodium formate 
clusters. Analytes were identified based on comparison of both the CE migration 
time and mass match to that of standard mixtures of analytes. 
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Extended Data Figure 1 | Anatomy and neuromuscular systems of the 
ctenophore Pleurobrachia bachei. a—-e, Anatomy of the ctenophore, 
Pleurobrachia bachei A. Agassiz, 1860. Natural colouration of the major organs 
in live animal is shown. a, Details of the transparent Pleurobrachia body are 
shown including, b, the pharynx and tentacle sheaths (pockets). Eight rows of 
comb plates, called ctenes, are made of giant compound cilia that diffract light, 
creating iridescence. c, Comb rows in Pleurobrachia are constantly beating. 
The mouth and the aboral organ (AO) are located at the opposite poles of the 
animal (a, c). The AO controls complex coordinated behaviours of the animal. 
d, Ciliated furrows connect the AO and the ctenes to mediate behaviour. 

e, Tentacles have numerous contractile tentillae used to capture food with 


specialized glue cells or colloblasts (see also Fig. le). f-h, Pleurobrachia neural 
nets and muscles. f, Comb plate muscles (red) were revealed using in situ 
hybridization for B-tubulin and subepithelial neural net (green) revealed by 
tyrosinated «-tubulin immunostaining. g, In this image comb cilia (green) were 
stained using tyrosinated o-tubulin antibodies (green) whereas underlying 
comb plate muscles were visualized by phalloidin (a muscle marker) that did 
not stain neurons. h, Organization of the subepithelial neural net around the 
mouth as revealed by tyrosinated o-tubulin antibodies (whole-mount 
preparation). Scale bars: 120 um (f); 100 Lum (g); 200 um (h). See 
Supplementary Methods 10 and 11. 
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Extended Data Figure 2 | DNA methylation and active DNA demethylation 
in Pleurobrachia bachei. CpG DNA methylation facilitates the elimination of 
CpG dinucleotides over evolutionary time®’. a, Histogram shows relative 
occurrences of different dinucleotides in genomes of P. bachei (red bars), 
Drosophila melanogaster (green bars, no DNA methylation) and Homo sapiens 
(blue bars). The P. bachei genome contains 2.3% CpG dinucleotides, which is 
much lower than the expected random frequency and, therefore, indicative 
of a genome that undergoes methylation compared to humans®. b, DNMT 
genealogy tree. The enzyme DNA methyltransferase (DNMT), which catalyses 
transfer of a methyl group to DNA to form 5- methyl cytosine (5-mC), is 
present in Pleurobrachia. c, TET family of enzymes catalyses active DNA 


demethylation via formation of 5-hydroxymethyl cytosine (5-hmC, the 6th 
DNA base). RNA-seq profiling reveals differential expression for DNMT and 
TET-like genes during development and in adult P. bachei. Both DNMT 

and TET-like genes are predominantly expressed during cleavage starting from 
the 1st division. However, the TET-like gene is also highly expressed in adult 
combs (asterisk). The y axis shows a normalized expression level for each 
transcript. d, ELIZA-based colorimetric assays validate the presence of both 
5-mC and 5-hmC in the P. bachei genome (the rat brain is used as a positive 
control; n = 6 for Pleurobrachia and n = 3 for rat; data shown as mean = s.e.m., 
see Supplementary Methods 8 and Supplementary Data section 3 for details). 
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Extended Data Figure 3 | Animal phylogeny. a, Phylogeny of Metazoa based 
on 586 genes. Topology inferred using RAxML 7.2.7 and maximum likelihood 
with the CAT +WAG + F model with all taxa from Supplementary Table 12. 
Bootstrap support values are listed at each node. Colour coding: purple, 
ctenophore; yellow, Porifera; pink, Cnidaria; light blue, Bilateria. b, Removal 
of fast-evolving taxa Trichoplax and Caenorhabditis improves topological 
robustness. Topology inferred using RAxML 7.2.7 and maximum likelihood 
with the CAT +WAG + F model with all taxa from Supplementary Table 12 
except Trichoplax and Caenorhabditis. Bootstrap support values are listed at 
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each node. c, Removal of distant out-groups such as Fungi and Filasterea 
further improves topological robustness. Topology inferred using RAxML 7.2.7 
using maximum likelihood with the CAT +WAG + F model with all taxa 
from Supplementary Table 12 except Trichoplax, Caenorhabditis and 
non-choanoflagellate out-groups. Bootstrap support values are listed at each 
node. d, Analysis with improved ctenophore taxon sampling based on 114 
genes. Topology inferred using RAXML 7.2.7 using maximum likelihood with 
the CAT +WAG + F model with all taxa from Supplementary Table 13. 
Bootstrap support values are listed at each node. 
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Extended Data Figure 4 | Identification of tentacle- and comb-specific 
transcripts. a, Identification of tentacle-specific transcripts. The left photo 
shows SEM of a Pleurobrachia tentacle with two branching tentillae densely 
covered with hundreds of colloblasts or glue cells. Comparative transcriptome 
(RNA-seq) profiling among major organs allowed us to identify several dozen 
genes differentially or uniquely expressed in tentacles. The histogram shows 
illustrative examples of some of these genes with a normalized expression level 
(y axis) for each represented transcript. One of these Pleurobrachia-specific 
genes we named tentillin (green arrow). In situ hybridization experiments 

(n = 9) revealed a remarkable cell-specificity expression pattern for tentillin in 
all main tentacle branches and tentillae, possible labelling colloblasts or 


associated secretory cells. Scale bar: 50 um. b, Identification of comb-specific 
transcripts. The left photo shows a microscopic image of one comb row from 
an intact animal. The natural colouration is a reflection of the iridescence 
patterns produced from large cilia forming combs. Comparative transcriptome 
(RNA-seq) profiling among major organs allowed us to identify several 
hundreds of genes differentially or uniquely expressed in combs. The histogram 
shows illustrative examples of some of these genes with a normalized 
expression level (y axis) for each represented transcript (see Supplementary 
Methods 4.2.3.6, 4.2.3.7 and 10, all sequences used in the analysis can be found 
in Supplementary Tables 29, 30 and 32). 
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Extended Data Figure 5 | Illustrative examples of tissue-specific gene 
expression in Pleurobrachia bachei. a, b, Dicer and Argonaut are 
predominantly expressed in structures associated to sensory and integrative 
functions. These include the aboral organ, polar fields and combs. Note a 
relatively weak staining of other cell types in the skin and following ciliated 
furrows in Dicer and Argonaut preparations. c, d, Pleurobrachia ELAV is 
expressed in combs and not in neurons. ELAVs are RNA-binding proteins and 
they are considered as pan-neuronal markers (see Supplementary Data 5.6.1). 
However, in Pleurobrachia ELAV expression has not been detected in 

neural tissues or cells with recognizable neuronal-like appearances. In situ 
hybridization for Pleurobrachia ELAV3 (c, d) shows the highest levels of 
expression in the adult comb plate but not in any of the neural tissues or organs 


Aboral Organ 


= 


enriched with neurons such as the aboral organ and polar fields. e, Wntx is 
selectively expressed in the aboral organ (AO) and major conductive pathways 
of Pleurobrachia, indicating its involvement in integrative and neural-like 
functions (in situ hybridization on a whole-mount preparation). One of the 
highest WntX expressions is found in AO and ciliated furrows whereas the 
polar fields showed a moderate expression level associated to their central 
regions. In situ hybridization was performed on whole mounts using 
DIG-labelled probes (see details in the Supplementary Methods, all in situ 
hybridization were performed at least on 4-5 different animals and these 
are representative photos for these experiments). Scale bars: 500 tm (a-d); 
800 um (e). 
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Presence of these analytes were confirmed with mass spectrometry 


Extended Data Figure 6 | Absence of serotonin in ctenophores. Here we 
used nanolitre volume sampling, capillary electrophoresis separation and 
wavelength-resolved native fluorescence detection as described for 
ultra-sensitive assay of 5-hydroxytryptamine (serotonin or 5-HT) and related 
metabolites (a, the top electropherogram and the table with standards used). 


LOD, amoles 
a- Serotonin (5-HT) 50 
b - Tryptophan 9 
c- Sulphorhodamine 101 
d- 5-Hydroxyindole acetic acid 25 
e- 3-Indole butyric acid 130 
f- 3-ildole acetyc acid 60 


The hemichordate Saccoglossus bromophenolosus 
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Time (min) 


Limits of detection (LODs) range from the low attomole to the femtomole 
range, with 5-HT LODs being approximately 20-50 attomoles. b, c, Using this 
assay we failed to detect 5-HT in Pleurobrachia (b, n = 6) but 5-HT was reliably 
detected in the hemichordate Saccoglossus (c) and molluscs®. See details in 
Supplementary Methods 17 and Supplementary Table 22 for quantification. 
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Extended Data Figure 7 | L-glutamate as a transmitter candidate in 
Pleurobrachia bachei. a, The ionotropic glutamate receptors (iGluRs) are 
diverse and underwent substantial adaptive radiation within the Ctenophora 
lineage. Phylogenetic analysis shows Pleurobrachia iGluRs share highest 
identity to each other forming a distinct branch on the tree topology 
(Supplementary Data 5.9). b, Differentially expressed iGluR subtypes in 
Pleurobrachia bachei (red and green labelling with fluorescent in situ 
hybridization protocols). Dark blue fluorescence is DAPI nuclear staining. 
AO, aboral organ. Scale bars: 100 tum (top left and top middle); 60 um (top 
right); 50 pm (bottom left); 30 um (bottom middle); 200 jm (bottom right). 
c-f, Glutamate-induced action potentials and currents in muscle cells. 

c, Typical responses of ctenophore muscle cells to glutamate pulses recorded 


25 30 
Migration time, min 


30 40 


extracellularly (as individual action potentials/contractions from a single 
muscle cell in response to local application of glutamate, 1 mM), and d, from 
the same cell in whole-cell current clamp mode with clear action potentials. 
e, Isolated muscle cell. Scale bar, 25 tm. f, Glutamate-activated whole-cell 
currents recorded from the same cell (as in c). Time course of application is 
depicted by the diagram below the voltage signal. Two responses (inward 
current) are shown. The holding potential was —70 mV (Supplementary 
Methods 13-16). g, Representative electropherograms show capillary 
electrophoresis separation with laser-induced fluorescence detection from 
different organs in Pleurobrachia bachei (n = 5) for transmitter candidate 
identification. The bottom electropherograms are standards (Supplementary 
Methods 17 and Supplementary Tables 23-25 for quantification). 
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Extended Data Figure 8 | Quest for putative secretory molecules in 
ctenophores. a, Computational pipeline for prediction of secretory products in 
Pleurobrachia and the overview of secretory products predicted from the 
Pleurobrachia gene models (Supplementary Method and Data sections 4.2.3.7 
and 5.11, respectively). b, Expression of novel secretory molecules in 
ctenophores (DIG-labelled in situ hybridization, Supplementary Data 5.11 and 
Supplementary Methods 10). Each of the predicted secretory prohormone was 
selected based on its unique and/or highly differential expression pattern as 
revealed by RNA-seq profiling. Ctenophorin is uniquely expressed in polarized 
cells around the mouth of Pleurobrachia and we found its homologues in all 
ctenophore species we sequenced. Tentillin is a Plerobrachia-specific gene, 
which is uniquely expressed in polarized secretory-like cells in tentillae and 
tentacles. Jansonin’s expression is primarily restricted to polarized cells located 


ARTICLE 


in the aboral organ and polar fields. For comparison, we showed a different but 
also cell-specific expression pattern of BarX transcription factor in cells of 
unknown identify localized in polar fields, comb plates and tentacles (see b4). 
c, d, Most predicted secretory products are expressed later in development and 
in adult organs of Pleurobrachia (RNA-seq). ¢, Expression patterns of 72 
predicted prohormones in P. bachei indicates that 20 of them are present and 
differentially expressed in development (Supplementary Table 32 for all 
Pleurobrachia precursor sequences). Surprisingly 5 of these precursor mRNAs 
were found starting from the second cleavage stage whereas the rest are 
predominantly expressed on day 3 of development. This correlates to the first 
appearance of neurons in Pleurobrachia cydippid larva (see Supplementary 
Data 5.11 and Supplementary Method section 4.2.3.6 for the RNA-seq 
analysis). 
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Extended Data Figure 9 | Diversity of ion channels in Pleurobrachia bachei. 
a, Metazoan ion channel complement. The 112 ion channels identified in the 
Pleurobrachia genome are classified as voltage gated (v) or other gating such 
as second messengers. Receptor channels (R) are ligand-gated or ionotropic 
(iGluR, ChRN, HTR3, GABA and CLR) and indicated in grey. Metazoan 
novelties indicate type of ion channels absent in the choanoflagellates, the sister 
group to all animals. Coloured squares show channels: (1) primarily absent in 
ctenophores (pink); (2) secondarily lost in sponges or placozoans (dark yellow); 
(3) eumetazoan (Cnidaria+ Bilareria) innovations (blue); or (4) examples of 
expansion of certain class of channels in some animal lineages (red). All 


Pleurobrachia sequences used in the analysis can be found in Supplementary 
Table 31. b, Ion channels are predominantly expressed in tentacles, combs and 
aboral organ. Hierarchical clustering of 112 identified ion channels in 
developmental stages and adult tissues of Pleurobrachia. Adult organs involved 
in food capture and ciliated locomotion and integrative functions show 
significantly higher diversity and overall higher level of expression levels for 
most of ion channel types. Mobile tentacles had the highest expression of 
voltage-gated channels, in particular Ca, and Na,. The legend shows relative 
expression levels based on RNA-seq data (see Supplementary Methods 4.2.3.6). 
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miR-34/449 miRNAs are required for 
motile ciliogenesis by repressing cp110 


Rui Song'*, Peter Walentek*, Nicole Sponer!*, Alexander Klimke*, Joon Sub Lee’, Gary Dixon!, Richard Harland’, Ying Wan", 


Polina Lishko!, Muriel Lize>, Michael Kessel® & Lin He! 


The mir-34/449 family consists of six homologous miRNAs at three genomic loci. Redundancy of miR-34/449 miRNAs 
and their dominant expression in multiciliated epithelia suggest a functional significance in ciliogenesis. Here we report 
that mice deficient for all miR-34/449 miRNAs exhibited postnatal mortality, infertility and strong respiratory dysfunc- 
tion caused by defective mucociliary clearance. In both mouse and Xenopus, miR-34/449-deficient multiciliated cells 
(MCCs) exhibited a significant decrease in cilia length and number, due to defective basal body maturation and apical 
docking. The effect of miR-34/449 on ciliogenesis was mediated, at least in part, by post-transcriptional repression of 
Cp110, a centriolar protein suppressing cilia assembly. Consistent with this, cp110 knockdown in miR-34/449-deficient 
MCCGs restored ciliogenesis by rescuing basal body maturation and docking. Altogether, our findings elucidate conserved 
cellular and molecular mechanisms through which miR-34/449 regulate motile ciliogenesis. 


MicroRNAs (miRNAs) encode a class of small, non-coding RNAs that 
regulate gene expression through post-transcriptional repression’ ~. 
Although the initial discovery of miRNAs was made through classic 
forward genetics in worm development”, loss-of-function studies on 
most individual miRNAs yield no overt developmental defects in mul- 
tiple organisms, suggesting strong functional redundancy among homol- 
ogous miRNAs”*. Redundant miRNAs can be generated from multiple 
genomic loci or transcribed from a single polycistronic precursor. Col- 
lectively, these redundant miRNAs could constitute the majority of 
expressed miRNAs in specific cell types”'®. Such extensive homology 
and dominant cell-type specific expression of a single miRNA family 
could confer a robust functional readout that can only be revealed by 
complete removal of redundant miRNAs. 

mir-34/449 miRNAs constitute a conserved family in vertebrates 
comprising three genomic loci, mir-34a, mir-34b/34c and mir-449c/449b/ 
449a (mir-449), which encode six homologous miRNAs (miR-34a, 34b, 
34c, 449a, 449b and 449c)'*"> (Fig. 1a and Extended Data Fig. 1a). Sequence 
homology among miR-34/449 miRNAs, particularly at the seed region, 
predicts robust functional redundancy. miR-34/449 miRNAs are highly 
enriched in mucociliary epithelia that contain motile cilia’®, which beat 
coordinately to generate fluid movement'®*””. Structural and functional 
defects in motile cilia are associated with a human syndrome, primary 
cilia dyskinesia (PCD)'*"”. Here we demonstrate that miR-34/449-deficient 
mice developed PCD-like respiratory and fertility phenotypes. Con- 
sistently, miR-34/449 deficiency in mice and frogs disrupts ciliogenesis 
in mucociliary epithelia, causing reduced cilia length and number due 
to impaired basal body maturation and apical docking. This is, at least 
in part, mediated by direct miR-34/449 repression of Cp110, a centriolar 
protein suppressing cilia assembly'*’. These findings reveal conserved 
cellular and molecular mechanisms underlying the functions of miR- 
34/449 in MCC ciliogenesis. 


11-13 
> 


PCD-like phenotype in mir-34/449 TKO mice 
To characterize mir-34/449 functions, we generated triple knockout 
(TKO) mice deficient for all mir-34/449 loci (mir-34a, mir-34b/34c and 


mir-449)° (Extended Data Fig. 1b, c). Although mir-449 resides in intron 
2 of cdc20b"', mir-449 deletion in mice does not negatively affect cdc20b 
expression (data not shown). TKO mice were born at a Mendelian ratio 
with normal body weight (Fig. 1b and Extended Data Fig. 1d); yet exhib- 
ited frequent postnatal mortality with only ~40% surviving to adult- 
hood (Fig. 1b). TKO mice also exhibited growth attenuation with ~50% 
lower body weight than littermate-controlled double knockout (DKO) 
mice (mir-34a /"; mir-34b/34c/~ or miR-34a ‘~; mir-449/~, Fig. 1c 
and Extended Data Fig. 1d). 

Surviving TKO mice showed severe respiratory distress character- 
ized by frequent coughing and sneezing (Extended Data Fig. le and 
Supplementary Video 1). Dying and surviving TKO mice displayed res- 
piratory dysfunction, with excessive mucus accumulation in the paranasal 
cavities and increased susceptibility to respiratory infections (Fig. ld and 
Extended Data Fig. 1f). Littermate-controlled mir-34a /~ 3 mir-34b/ 
34c /~ or mir-34a ‘; mir-449-‘~ DKO mice phenotypically resem- 
bled wild-type mice, without obvious developmental or respiratory defects 
(Fig. 1d and Extended Data Fig. 1f, g). 

Unlike phenotypically normal DKO controls (mir-34a"‘~; mir-449°/~ 
or mir-34a /; mir-34b/34c / ~), surviving TKO males and females were 
infertile, generating no pregnancies when mated with wild-type animals. 
TKO males exhibited defective spermatogenesis during differentiation 
from elongating spermatids to spermatozoa (Fig. leand Extended Data 
Fig. 2a, b), when flagella formation occurs'’. TKO females exhibited a 
decrease in epithelial ciliation of the fallopian tube, presumably causing 
defects in oocyte transport’” (Fig. le and Extended Data Fig. 2a, c). 
Altogether, the TKO mouse phenotype resembled symptoms of a subset 
of PCD patients, exhibiting predominant respiratory and fertility defects 
without hydrocephaly or left-right asymmetry defects”* (Extended 
Data Fig. 2d). 


Ciliogenesis defects in mir-34/449 TKO mice 
Mature miR-34/449 miRNAs were enriched in organs containing motile 


cilia, including lung, brain, testis and female reproductive organs (Extended 
Data Fig. 3a). We specifically detected and quantified individual miR-34/449 
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Figure 1 | mir-34/449 TKO mice exhibit defective mucociliary airway 
clearance and infertility. a, Gene structure (top) and sequence alignment 
(bottom) of mouse miR-34/449 miRNAs. Red box indicates seed sequences. 
b, TKO mice exhibit frequent postnatal mortality shown as log-rank test. 

c, Surviving TKO mice display postnatal growth retardation. d, Excessive 
mucus accumulation and infection in paranasal sinuses of dying TKO at P7 


miRNAs using single knockout and TKO controls (Fig. 2a and Extended 
Data Fig. 3b, c). In situ hybridization revealed high-level miR-34/449 
expression in respiratory epithelia, with miR-34a being expressed broadly 
in multiple cell types, and mir-34c or miR-449c being enriched specif- 
ically in airway MCCs (Fig. 2a and Extended Data Fig. 3d). 

A major symptom of PCD is dysfunctional airway clearance’*”’. 
Defective mucociliary clearance in mir-34/449 TKO mice, along with the 
MCC-specific miR-34/449 expression, prompted us to examine the roles 
of miR-34/449 in airway MCCs. High-speed imaging revealed a slow 
and limited fluid movement in TKO tracheal explants, accompanied 
by a significant reduction of visibly ciliated MCCs (Fig. 2b and Sup- 
plementary Video 2). This contrasts the effective anteriorward fluid flow 
in wild-type and DKO tracheal explants (Fig. 2b and Supplementary 
Videos 2 and 3). 

The decrease of visible MCCs in TKO tracheas could reflect defective 
cell fate specification or ciliogenesis. We analysed mir-34a‘~; mir- 
449-'~ DKO and TKO tracheas for Foxj1, a master regulator of motile 
ciliogenesis™, and acetylated «-tubulin (Ac-a-tub), a cilia marker. Both 
DKO and TKO tracheas had Foxjl immunofluorescence staining and 
Foxj1 mRNA levels comparable to wild-type controls (Fig. 2c and Ex- 
tended Data Fig. 3e). Nevertheless, a portion of Foxj1-positive cells lacked 
cilia in TKO tracheas, yet most Foxj1-positive cells were fully ciliated in 
DKO and wild-type tracheas (Fig. 2c, and data not shown). This sug- 
gests normal cell fate specification with defective ciliation in mir-34/449- 
deficient MCCs. Scanning electron microscopy supported this finding, 
revealing a significant reduction in cilia length and number per MCC 
in TKO, but not in DKO and wild-type tracheal epithelia (Fig. 2d and 
Extended Data Fig. 3f). Notably, TKO MCCs displayed a spectrum of 
ciliation phenotypes (non-ciliated, partially or fully ciliated; Fig. 2d), pos- 
sibly due to the mixed genetic background. 

MCC ciliogenesis is characterized by the multiplication of basal 
bodies, which, after docking to the apical membrane, act as microtubule- 
organizing centres to assemble motile axonemes””®. We stained TKO 
tracheas with antibodies against Ac-o.-tub and y-tubulin (y-tub) to visualize 
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(top) and surviving adult TKO mice (bottom). Arrow, infection; arrowhead, 
mucus accumulation; n = 15. e, Adult TKO males and females are infertile. 
Although early spermatids (Sd) are developed, few intact spermatozoa (Sz) are 
generated (top, n = 3). A significant MCC reduction is observed in TKO 
fallopian tubes (bottom, n = 3). 
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Figure 2 | miR-34/449 deficiency causes ciliogenesis defects in respiratory 
MCCs. a, miR-34/449 are strongly enriched in MCCs of respiratory epithelia 
(arrowheads), shown by in situ hybridization (n = 2). b, TKO tracheal epithelia 
exhibit defective mucociliary clearance demonstrated by live imaging of 
fluorescent bead transport. Red, visibly ciliated MCCs; n = 4. ¢, Cell-fate 
specification of MCCs is unaffected in TKO tracheas. Immunofluorescence 
staining for Foxjl (a MCC marker) is unaltered in TKO tracheas, yet a large 
number of Foxjl-positive MCCs (arrowhead) have decreased staining for 
Ac-o-tub (a cilia marker). n = 3. d, TKO tracheal MCCs have a significant 
reduction in cilia number and length, revealed by scanning electron microscopy 
(n = 3). 
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Figure 3 | miR-34/449 deficiency causes defective basal body docking in 
mouse airway MCCs. a, TKO tracheas exhibit ciliation defects, shown by 
immunofluorescence staining for Ac-c-tub (cilia) and -tubulin (basal bodies). 
Numbers on the image indicate fully (1), partially (2) or non-ciliated (3) MCCs; 
n = 3. b-d, Basal bodies fail to dock to the apical membrane of TKO MCCs. 


cilia and basal bodies of MCCs, respectively. Consistent with previous 
observations, Ac-c-tub staining was greatly decreased in TKO tracheas, 
yet the percentage of y-tub-enriched MCCs remained normal (Fig. 3a 
and Extended Data Fig. 4a, b). Air—liquid interface (ALD) culture of pri- 
mary TKO tracheal epithelia yielded a similar observation (Extended 
Data Fig. 4c). Thus, basal body multiplication occurred normally in 
miR-34/449-deficient MCCs following MCC cell fate specification, yet 
cilia formation was impaired. 


Impaired basal body docking in mir-34/449 TKO MCCs 
Basal body docking to the apical MCC membrane is essential for proper 
ciliogenesis**”°. In mir-34a /~; mir-34b/34c_/- DKO MCCs, y-tub 
staining was apically localized, indicating normal basal body docking 
(Fig. 3b). In contrast, y-tub staining was diffuse in TKO tracheal MCCs 
and ALI culture, suggesting defective basal body docking to or stabiliza- 
tion at the apical membrane (Fig. 3b and Extended Data Fig. 4d). Trans- 
mission electron microscopy revealed well-aligned basal bodies at the 
apical membrane of wild-type and DKO MCCs (Extended Data Fig. 4e). 
Yet in TKO MCCs, a significant percentage of basal bodies were mis- 
localized to the cytoplasm and unable to grow cilia, and those apically 
docked generally formed shorter cilia (Fig. 3c, d). The extent of ciliation 
defects correlated well with basal body docking defects, suggesting aber- 
rant basal body docking as a key mechanism for impaired ciliation (Fig. 3d). 
Defective ciliation and basal body docking in TKO MCCs also corre- 
lated with a disturbed apical actin organization (Extended Data Fig. 4f). 

Despite defective basal body docking in TKO MCCs, the structural 
components of basal bodies, either apically docked or mislocalized, 
remained largely intact (Extended Data Fig. 5a, b). Although axoneme 
structure was unaffected in TKO MCCs, basal body orientation and 
ciliary axoneme directionality exhibited mild defects (Extended Data 
Fig. 5c, d), which, in combination with the ciliation defects, probably 
evoked a strong mucociliary clearance phenotype. 


miR-34/449 functions in Xenopus MCCs 


Mammalian and Xenopus miR-34/449 miRNAs are not only conserved 
in sequence’*”’, but also in MCC-specific expression and ciliogenesis”®. 
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b, Lateral projections of confocal micrographs shown in a. c, Transmission 
electron microscopy (TEM) confirms basal body docking defects in TKO 
MCCs (n = 3). d, Quantification of basal body docking based on TEM studies 
in c. Docked and undocked basal bodies exhibit a distance = 0.3 tm and 

> 0.3 sm to the apical membrane, respectively. Error bar, s.e.m. 


Asa mucociliary epithelium, the Xenopus embryonic epidermis resem- 
bles mammalian airway epithelia in MCC development and function”. 
As in mouse, knockdown of Xenopus miR-34a/34b/449a by morpholino 
(MO) injection (miR-34/449 MOs) reduced cilia number and length in 
tadpole epidermal MCCs (Fig. 4a, b). No obvious defects in embryonic 
development, hydrocephalus, MCC cell fate specification or other cell 
type specification were observed (Extended Data Fig. 6a—f). In miR-34/ 
449 morphants, a significant portion of MCCs were either partially ciliated 
(> 50%), or devoid of cilia (29% + 17%), with frequent, unorganized 
subapical Ac-o.-tub enrichment indicating defective basal body docking”*® 
(Fig. 4b). Consistently, basal bodies detected by y-tub or Sas6-GFP”’ 
exhibited irregular distribution and frequently failed to form cilia in 
miR-34/449-deficient MCCs (Fig. 4c and Extended Data Fig. 6g). 


miR-34/449 miRNAs directly repress cp110 in MCCs 


The miR-34/449 increase during MCC differentiation” predicts a de- 
crease of functionally important targets, which invariably contain miR- 
34/449 binding site(s) in the 3’ untranslated region (3’ UTR)*!”*. We 
analysed published gene expression profiles of tracheal MCCs during 
differentiation’, then selected 57 potential miR-34/449 targets for quan- 
titative polymerase chain reaction with reverse transcription (qRT-PCR) 
validation in DKO and TKO tracheas, and finally narrowed down to 
those with important ciliogenesis functions (Extended Data Table 1). 
Cp110 emerged as a strong candidate, containing two miR-34/449 bind- 
ing sites (Extended Data Fig. 7a) and exhibiting miR-34/449-dependent 
repression in vivo and in luciferase assays (Fig. 5a, b and Extended Data 
Fig. 7b, c). 

Cp110 isa distal centriolar protein suppressing primary cilia assembly'*”. 
Aberrant Cp110 retention in mother centrioles is correlated with im- 
paired basal body docking™. In addition to its well-characterized roles 
in primary cilia, Cp110 is also implicated as an important regulator of 
motile ciliogenesis**”*. 

mir-34/449 and cp110 are conserved in mice and frogs. Xenopus cp110 
contains one predicted miR-34/449 binding site (Extended Data Fig. 7d), 
suggesting a selective pressure to preserve miR-34/449-cp110 regula- 
tion. miR-34/449 and cp110 levels are inversely correlated during MCC 
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Figure 4 | miR-34/449 deficiency causes defective ciliogenesis in the 
Xenopus embryonic epidermis. a, MCCs in miR-34/449 morphants show 
reduced cilia length and number, demonstrated by immunofluorescence for 
Ac-o.-tub (cilia) and phalloidin-488 (actin). b, Quantification of MCC ciliation 
ina. P< 0.001 froma x°-test. c, Co-staining of Ac-o.-tub (cilia) and y-tub (basal 


differentiation in Xenopus; and cp110 mRNA was derepressed in miR- 
34/449 morphants (Extended Data Fig. 7e, f, g). Strikingly, cp1 10 knock- 
down in miR-34/449-deficient MCCs significantly rescued ciliation defects 
(Fig. 5c and Extended Data Fig. 8a, b). 

We subsequently examined, in control and miR-34/449 morphants, 
the localization of Centrin4 (ref. 37), an important basal body compo- 
nent whose loss-of-function causes a PCD-like phenotype”’. Although 
strong Centrin4 foci were enriched apically in controls, these foci signi- 
ficantly decreased in intensity and failed to localize to the apical mem- 
brane of miR-34/449-deficient MCCs (Fig. 5d and Extended Data Fig. 8c). 
These findings demonstrated defective basal body docking and decreased 
basal body incorporation of Centrin4 in miR-34/449-deficient MCCs, 
both of which were rescued by co-injection of cp110 MO (Fig. 5d). 

Consistently, cp110 overexpression generally phenocopied miR-34/ 
449 knockdown, causing impaired ciliation and decreased Centrin4 incor- 
poration into basal bodies, without affecting MCC cell fate specification 
or apical actin organization (Fig. 5c and Extended Data Fig. 8b, d, e and 9a). 
cp110 with a 3' UTR deletion exhibited a stronger phenotype than full- 
length cp110, suggesting cp110 repression by miR-34/449 even during 
overexpression (Extended Data Fig. 7d and 8b, d). Surprisingly, cp110 
MO injection alone also gave rise to reduced cilia number and length, 
and aberrant basal body aggregation in MCCs (Fig. 5c and Extended 
Data Fig. 8a, b and 9b). Thus, miR-34/449 miRNAs mediate precise 
Cp110 regulation during vertebrate MCC ciliogenesis (Fig. 5e). 


Discussion 


miR-34/449 represent the first non-coding RNAs whose deficiency 
causes a PCD-like airway and fertility phenotype. miR-34 miRNAs have 
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bodies) in miR-34/449 morphants reveals uneven/aggregated distribution of 
basal bodies, which frequently fail to form cilia. Total numbers of embryos/total 
number of cells analysed were uninjected (4/14; a total of 14 cells from a total 
of 4 injected embryos) and miR-34/449 MOs (5/30). Embryos were derived 
from at least two females and independent fertilizations per experiment. 


been mostly characterized as p53 targets that elicit multiple tumour sup- 
pressor effects***’; yet the MCC-specific miR-34/449 expression and 
functions are likely p53-independent (data not shown). Although most 
characterized PCD mutations affect structural components of the ciliary 
axoneme’’® or basal body structure*’, miR-34/449 miRNAs regulate 
ciliogenesis by promoting basal body maturation and docking without 
affecting overall basal body structure. Interestingly, redundant miR-34/ 
449 miRNAs are not functionally equivalent in mice. One mir-34b/34c 
or mir-449 allele is sufficient for proper MCC ciliation; two intact mir- 
34a alleles in mir-34b/34c /~; mir-449-/— DKO mice still yield clear 
respiratory and fertility phenotypes (data not shown). Distinct roles of 
miR-34/449 miRNAs could reflect differential expression rather than 
target specificities. 

Ina previous study, miR-449 inhibition alone caused defective cilio- 
genesis by derepressing Notch1 and DIl1 (ref. 10). Yet normal MCC 
specification in miR-34/449-deficient MCCs suggest that Notch path- 
way components, with well-characterized roles in regulating MCC 
specification”’, may not act as key miR-34/449 targets in motile cilio- 
genesis. Here we provide molecular and functional evidence demon- 
strating cp110 as a major miR-34/449 target. Cp110 levels have to be 
tightly regulated spatially and temporally during ciliogenesis; and proper 
Cp110 removal from mother centrioles is essential for ciliation’’. Previ- 
ous studies mostly focused on ubiquitin-mediated proteasomal degradation 
of Cp110 (refs 44, 45). Our study reveals the important post-transcriptional 
regulation of Cp110 by miR-34/449. Although MCC-enriched miR- 
34/449 miRNAs repress cp 110 to facilitate ciliation, other miRNAs (for 
example, miR-129) repress cp110 expression in other cell types to reg- 
ulate ciliogenesis of primary or motile mono-cilia*®. Thus, ciliogenesis 
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c, Co-injection of miR-34/449 and cp110 MOs rescues MCC ciliation, whereas 
cp110A3'UTR overexpression phenocopies miR-34/449 morphants. cp110 MO 
alone also induced ciliation defects. Ac-a-tub, cilia; phalloidin-488, actin. 


is controlled by downregulation of cp110 through distinct miRNAs in 
distinct cell types. 

For miRNAs, their small size and imperfect target recognition facil- 
itate regulation of multiple mRNA targets. Here, cp110 knockdown 
restored basal body maturation/docking and ciliogenesis in miR-34/ 
449-deficient MCCs; however, additional miR-34/449 targets prob- 
ably exist to regulate the organization of the apical actin cytoskeleton 
in MCCs. 

Mucociliary epithelia are morphologically and functionally con- 
served among vertebrates”’. Our findings demonstrated a conserved 
mechanism that regulates basal body maturation and docking in MCCs 
by miR-34/449-dependent fine-tuning of Cp110 levels. This mecha- 
nism could have profound implications for the underlying mechan- 
isms disrupted in patients with PCD-like syndromes. 


METHODS SUMMARY 


Detailed information on the following methods can be found in the full Methods: 
mouse breeding and monitoring, histological analyses, RT-PCR, in situ hybridization 
(ISH), visualization of ciliary beating and mucociliary transport, immunoflures- 
cence staining, scanning and transmission electron microscopy (SEM and TEM), 
western blotting and manipulation of Xenopus embryos. 


Online Content Any additional Methods, Extended Data display items and Source 


Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Mouse breeding, genotyping and monitoring. mir-34a~/~; mir-34b/34c*’; 
mir-449 /~ intercross mating and mir-34a ee “3 mir-34b/34c / oe mir-449*’~ inter- 
cross mating were both established to generate mir-34/449 triple knockout (TKO) 
mice. TKO mice were generated on a mixed genetic background containing C57BL/6, 
129 and CD1, and were subsequently backcrossed to C57BL/6 for at least four gen- 
erations. All mice were housed in a non-barrier animal facility at UC-Berkeley. The 
following primers were used for genotyping, with parenthetical values indicating 
the size of the diagnostic PCR product: mir-34a-Common-R, ACTGCTGTACCC 
TGCTGCTT, with mir-34a-WT-F, GFACCCCGACATGCAAACTT (wild-type band, 
400 bp), or mir-34a-KO-F, GCAGGACCACTGGATCATTT (KO band, 263 bp); 
mir-34b/34c-Common-R, GAGATTTTCGTGGCGCTTTA, with mir-34b/34c-WT-EF, 
GCCTCCTGTGAATCGTCATT (wild-type band, 264 bp), or mir-34b/34c-KO-F, 
GCGGCCGCATAACTTCGTAT (KO band, 155 bp); . mir-449-Common-R, AC 
ATCCCCAAGATATCCCA, with mir-449-WT-F, GTATCCACGCCACCACA 
(wild-type band, 724 bp), or mir-449-KO-F, GAGTTTTCTGGGCTTGCC, (KO 
band, 406 bp). Litters were monitored daily for the first 60 days for survival, and 
their body weight was measured every other day for the first 30 days. The sound wave 
analyses were performed using audios that recorded the TKO phenotypes (Sup- 
plementary Video 1), and respiratory sound was analysed using Audacity. 
Histological analyses. Tissues were dissected and fixed overnight in 10% neutral 
buffered formalin, pH 7.4 (NBF) (Fisher Scientific, #SF100-4), processed by stan- 
dard procedures, and embedded in paraffin blocks. All the blocks were sectioned 
at 10 jum, and slides were stained by hematoxylin and eosin (H & E). In this analysis, 
lungs were inflated via trachea with 10% NBF before fixation; and sinuses were 
processed by post-fixation decalcification for 5 to 10 days in 10% EDTA, pH 7.0. 
qRT-PCR. Total RNA was isolated by TRIzol (Invitrogen, #15596) from tracheal 
epithelium per the manufacturer’s protocol, and treated with DNase I (Invitrogen, 
#18068) to remove DNA contamination. For quantitation of mRNAs, TRIzol pre- 
pared RNA was reversely transcribed into cDNA using SuperScript III reverse tran- 
scriptase (Invitrogen, #18080) with random primers. SYBR Green-based qPCR was 
subsequently performed on a 7900HT real-time PCR system (Applied Biosystems) 
using cDNA as template. The Gapdh-encoding transcript was used as an endog- 
enous control in each qPCR. The following qPCR primers were used in this study, 
with parenthetical values indicating the size of the diagnostic PCR product: Cp110- 
F: TCTCCACTGCTTACCATTGA, and Cp110-R: GTAAATGGTTTCTGTTGC 
CC (195 bp); Foxjl-F: CTCCTATGCCACTCTCATCT, and Foxj1-R: GGATGGA 
ATTCTGCCAGGTG (137 bp); Gapdh-F: AACTTTGGCATTGTGGAAGG, and 
Gapdh-R: CACATTGGGGGTAGGAACAC (222 bp). Four independent groups 
of mice were collected for RT-PCR quantitation of Cp110 and Foxj1. Each group 
contained a TKO mouse, an age-matched wild-type mouse, and a littermate-controlled 
DKO mouse (mir-34a“"; mir-34b/34c /~ or mir-34a/ 3 mir-449/ ~ ). For Xenopus 
laevis (RT-PCR, cDNA was generated from total RNA extracts of skin explants using 
iScript reverse transcription supermix (BioRad, # 170-8840); and the following qPCR 
primers were used: Foxj1-F: CCAGTGATAGCAAAAGAGGT, and Foxj1-R: GC 
CATGTTCTCCTAATGGAT; Cp110-F: AGCCAGAATCCAAGTAAAGG, and 
Cp110-R: CTTGCTTCTTTTCAGCAGTC; EF la-F: CCCTGCTGGAAGCTCTT 
GAC, and EFla-R: GGACACCAGTCTCCACACGA; ODC-F: GGGCTGGATC 
GTATCGTAGA, and ODC-R: TGCCAGTGTGGTCTTGACAT. Reactions were 
performed on a BioRad CFX96 Real-Time System C1000 Touch. 

For miRNA quantitation, TRIzol prepared total RNA was poly (A)-tailed by Poly 
(A) Polymerase (Epicentre, #PAP5104H). Poly (A)-tailed small RNA was reversely 
transcribed into small RNA cDNA with SuperScript III reverse transcriptase (Invi- 
trogen, #18080) using miRNA RT primer (5'-CGAATTCTAGAGCTCGAGGCA 
GGCGACATGGCTGGCTAGTTAAGCTTGGTACCGAGCTCGGATCCACTA 
GTCCTTTTTTTITTTTTTTTTITTTTTTTVN-3’). V is A, G or C; N is A, G, C 
or T. TaqMan-based qPCR was subsequently performed on a 7900HT fast real- 
time PCR system (Applied Biosystems). The U6 snRNA was used as the endog- 
enous control for miRNA real-time qPCR analyses. Universal TaqMan probe, CT 
CGGATCCACTAGTC; Universal reverse primer, CGAATTCTAGAGCTCGAG 
GCAG. The following forward primers, specific for each small RNA, were used in 
our studies: miR-34a, TGGCAGTGTCTTAGCTGGTTGT; miR-34b, AGGCAG 
TGTAATTAGCTGATTGT; miR-34c, AGGCAGTGTAGTTAGCTGATTGC; miR- 
449a, TGGCAGTGTATTGTTAGCTGGT; miR-449b, AGGCAGTGTTGTTAG 
CTGGC; miR-449c, AGGCAGTGCATTGCTAGCTGG; and U6 snRNA, CGCA 
AATTCGTGAAGCGTTCC. For Xenopus miRNA quantitation the following pri- 
mers were used: U6 snRNA: ATGTGAAGCGTTCCATATGA; miR-34a: TGGCA. 
GTGTCTTAGCTGGTTGTT; miR-34b: CAGGCAGTGTAGTTAGCTGATTG; 
miR449c: T@CACTTGCTAGCTGGCTGT. Statistical evaluation was performed 
using a paired t-test. 

In situ hybridization (ISH). Standard histology protocols were used to prepare 
P25 lung and trachea section for miRNA ISH using diethylpyrocarbonate (DEPC) 
treated water for all procedures. After deparaffinization and rehydration, slides were 
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fixed with 4% paraformaldehyde (PFA), treated with proteinase K, and fixed again 
with 4% PFA. Slides were incubated first with pre-hybridization solution (3 to 4h 
at 60 °C), and then with hybridization solution mixed with digoxigenin (DIG)-labelled 
LNA probes against each miR-34/449 miRNA (16 h at 60 °C). Post hybridization, 
slides were washed for 10 min at 60 °C ina graded series of SSC solutions (2X, 1.5X, 
0.2X), then incubated with alkaline phosphatase (AP)-conjugated anti-DIG anti- 
body in blocking solution. After washing with PBS and alkaline phosphatase (AP) 
buffer, the slides were incubated with NBT/BCIP in AP buffer to visualize blue ISH 
signals. Nuclear fast red (Sigma, #N3020) was used for nuclear counter-staining. Slides 
then were dehydrated and mounted with Permount (Fisher Scientific, #SP15-100). 
Solutions for ISH (BioChain #K2191020) as described above. DIG-labelled miR- 
34a (ACAACCAGCTAAGACACTGCCA), miR-34c (GCAATCAGCTAACTAC 
ACTGCCT), and miR-449c (CCAGCTAGCAATGCACTGCCT) probes were pur- 
chased from Exiqon (#38487-01, #38542-01, and #39641-01 respectively). For Xenopus 
laevis ISH, embryos were fixed in MEMFA at the indicated stages, and standard 
protocols were used for ISH and bleaching of embryos”’. Foxj1 antisense probe** 
was synthesized using SP6 polymerase (Promega, #P 1085). Whole mount ISH was 
performed on groups of 25 control and manipulated specimens per time point and 
batch, which were derived from two different mothers. 

Visualization of ciliary beating and mucociliary transport. Trachea from adult 
mouse was cut into 2 mm X 2 mm pieces under dissection microscope. Trachea pieces 
then were transferred into the chamber on a glass slide, which was made by placing 
a 0.5-mm sticky spacer (Bio-Rad, #SLF-1201) on the slide surface. The chamber 
was filled with 100 pl M199 Hank’s balanced salts medium (Invitrogen, #12350- 
039) mixed with 1 il red fluorescent 0.5-j1m microspheres (Invitrogen, # F-8812); 
and a coverglass was placed on the sticky spacer to seal the chamber. Live images of 
the tracheal epithelium were recorded with a high-speed GX-1 Memrecam camera 
(NAC Image Technology) attached to an Olympus [X71 microscope. DIC channel 
was used to record multiciliary beating, and the red fluorescent channel was used to 
record mucociliary transport. Videos were recorded at 250 frames per second (FPS) 
for 8 s, and are played at 250 FPS in Supplementary Videos 2 and 3. Image] was used 
to process and analyse raw images (Supplementary Videos 2 and 3). 
Immunofluorescence staining and confocal imaging. For immunofluorescence 
staining on cryosections, whole tracheas of adult mice were fixed overnight in ice- 
cold acetone, and then processed through a graded series of sucrose solutions (from 
5% to 20%, Fisher Scientific, #S5-500). Tracheas were embedded in (1:1) 20% sucrose 
and O.C.T. compound (Tissue-Tek, #4583) and sectioned with MICROM HM 550 
(Fisher Scientific) at —21 °C at a thickness of 6 um. Slides were washed in PBS (3X, 
15 min), blocked (1h at room temperature) in PBSTB (0.1% Triton X-100, 1% 
bovine serum albumin in PBS), and incubated (overnight at 4°C) with primary 
antibodies (1:400, anti-Foxj1, Sigma HPA005714; 1:1000 anti-acetylated-«-tubulin, 
Sigma T6793). Sides then were washed three times in PBST (0.1% Triton X-100 in 
PBS) and incubated (1h at room temperature) with secondary antibody (1:1000, 
Cy3-goat-anti-mouse, Molecular Probes A10521 or 1:500 Alexa Fluor 488 goat anti- 
rabbit, Molecular Probes A11034). Slides were mounted with VECTASHIELD mount- 
ing medium with DAPI (Vector Laboratories, #H-1200). Images were taken with a 
Zeiss LSM 710 AxioObserver inverted 34-channel confocal microscope and ana- 
lysed with Zeiss Zen software. 

For whole trachea staining, tracheas were cut longitudinally into two pieces, 
which were fixed either in 80% methanol (EMD, #MX0485P-4) with 20% DMSO 
(Fisher Scientific, #BP231-100) (overnight at —20 °C), or in 4% paraformaldehyde 
(PFA) (overnight at 4 °C), respectively. Fixed tissues were washed and blocked as 
described above, and then incubated (overnight at 4 °C) with primary antibodies 
(1:500 anti-y-tubulin, Sigma T5192 for the methanol fixed tissue; 1:400 anti-Foxj-1, 
Sigma HPA005714 and 1:1000 anti-acetylated-c.-tubulin for the PFA fixed tissue). 
Tissue then was washed, incubated with secondary antibody, counterstained with 
DAPI, and imaged as described above. 

For Xenopus laevis MCC staining, immunofluorescence was performed on whole- 
mount embryos and skin explants fixed at embryonic stages 30-33 (unless specified 
otherwise) in 4% paraformaldehyde for 1-2 h at room temperature” or in Dent’s 
fixative (80% methanol, 20% dimethylsulfoxide) for 48 h at —20 °C. Embryos were 
processed according to standard procedures**. Morphological analysis of cell types 
was performed as previously described**. Primary antibodies were as follows: mouse 
monoclonal anti-acetylated-o.-tubulin (1:700; Sigma T6793); rabbit polyclonal anti- 
y-tubulin (1:500; Sigma T5192). Secondary antibodies (1:250) were as follows: 
AlexaFluor 488-labelled goat anti-mouse antibody (Molecular Probes A11001), 
AlexaFluor 555-labelled goat anti-mouse antibody (Molecular Probes A21422), 
AlexaFluor 555-labelled goat anti-rabbit antibody (Molecular Probes A21428) and 
AlexaFluor 405-labelled goat anti-mouse antibody (Molecular Probes A31553). 
Actin staining was performed by incubation (30-60 min at room temperature) 
with AlexaFluor 488-labelled Phalloidin (1:40; Molecular Probes A12379). Z-stack 
analysis and processing were performed using Image] and Zeiss ZEN software. All 
confocal imaging was performed using a Zeiss LSM700. 
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Scanning electron microscopy (SEM). Adult trachea tissue was fixed using 
Karnovsky’s fixative in 0.1 M sodium phosphate buffer (Sorenson’s), washed with 
Sorenson’s sodium phosphate buffer and post-fixed using 1% OsO, in Sorenson’s 
for 1h. Tissue was dehydrated by passaging through a graded series of ethanol 
solutions, then critical point dried using a Tousimis 931 super critical point dryer. 
The tissue was mounted on aluminium stubs and sputter coated with gold using a 
PELCO SC-7 coater. The samples were viewed on an FEI XL30 TMP SEM and dig- 
ital images were collected. SEM was performed in the electron microscopy facility 
of the University of California at Davis. 

Transmission electron microscopy (TEM). Adult trachea tissue was fixed, washed, 
and post-fixed as for SEM. After rinsing in double-distilled water (DDW), the tissue 
was incubated (30 min at room temperature) in 0.1% tannic acid, rinsed again in 
DDW, and incubated (1 h) in 1% uranyl acetate in DDW. Tissue was dehydrated by 
passaging through a graded series of acetone solutions, then infiltrated and embedded 
in an epoxy resin mixture. Survey thick sections were cut, and ultra-thin sections 
of the selected areas were generated with a diamond knife (Diatome). The thin sec- 
tions were picked up on copper grids and stained with uranyl] acetate and lead citrate 
before viewing on a Philips CM120 Biotwin. Micrographs were taken with a Gatan 
MegaScan Model 794/20 digital camera. TEM analyses with longitudinal and trans- 
verse sections were performed in the electron microscopy facility of the University 
of California at Davis. Image] was used to measure the distance between basal bodies 
and apical surface in TEM pictures, and Oriana was used to analyse the directionality 
of ciliary axonemes. The following criteria were used to determine multiciliated cells 
(MCCs) and their apical surface in longitudinal TEM. Cells containing basal bodies 
in tracheal epithelium were defined as MCCs. For ciliated MCCs, the surface with 
intact view from ciliary axoneme to basal body was determined as the apical sur- 
face; and for non-ciliated MCCs, the surface with microvilli was determined as the 
apical surface. 

Air-liquid interface (ALI) culture of primary tracheal epithelia. Primary tracheal 
epithelial cells were cultured as described previously”. In short, tracheas from three 
adult mir-34/449 mutant mice of the same genotype (~ P60) were dissected and cut 
longitudinally in ice-cold Ham’s F-12 medium (Life Technologies, #11765-054) 
with penicillin/streptomycin (Life Technologies, # 15140-163). To isolate epithelial 
cells, tracheas were incubated in 1.5 mg ml! pronase (Roche Diagnostics, #101659 
21001) in Ham’s F-12 media overnight at 4 °C. We stopped the tracheal pronase diges- 
tion by adding fetal bovine serum (FBS, Omega Scientific, #FB-01) to a final con- 
centration of 10%. The tracheas were washed twice in ice-cold fresh Ham’s F-12 
media with 10% FBS. Tracheal epithelial cells were then pelleted by pooling the 
pronase digestion and washes for centrifugation at 400g for 10 min at 4°C. The 
pelleted cells were treated by DNase I (Sigma, #DN25), resuspended in 1 ml FBS, 
then plated in 9 ml pre-warmed basic medium (BM), which contained 1:1 DMEM: 
Ham’s F-12 (Life Technologies, #11330-032), penicillin/streptomycin, 1.5 mM glu- 
tamine (Life Technologies, #25030-149), 0.03% sodium bicarbonate (Life Technol- 
ogies, #25080-094), and 0.1% fungizone (Life Technologies, #15290-018)). Fibroblasts 
were depleted by a 4h incubation at 37 °C, after which the tracheal epithelial cells 
were plated on trans-wells (Corning, #3470) in 24-well plates in proliferation 
medium, which contained BM with 5% FBS, 25 ng ml~ 7 epidermal growth factor 
(EGF) (BD Biosciences, #354001), 10 pg ml ? insulin (Sigma, #11882), 5 ug ml! 
apo-transferrin (Sigma, #T1147), 0.1 ug ml 1 cholera toxin extract (Sigma, #C8052), 
30 pg ml~ | bovine pituitary extract (BPE) (Hammond Cell Tech, #1078-NZ), and 
50 nM retinoic acid (RA, Sigma, #R2625-500MG). The proliferation medium was 
changed every other day for 7 days, until we switched the culture to air-liquid inter- 
face culture by removing the media in the apical chamber, and replacing the media 
in the basal chamber with differentiation medium (BM with 2% Nu-Serum (BD, 
#355100) and 50 nM RA). The differentiation medium was changed every other day 
for 23 days, when cells were ready for immunofluorescence staining. 

Luciferase assay. A fragment of the Cp110 mRNA 3’ UTR containing two pre- 
dicted miR-34/449 sites was cloned into the Fsel site immediately downstream of 
the stop codon in the pGL3-Control firefly luciferase vector (Promega, #E1741). We 
amplified a fragment of Cp110 3’ UTR using PCR with Cp110-3’ UTR-F, AAGGC 
CGGCCGAAGACAGCACTCACTGGGA, and Cp110-3'UTR-R, GTGGCCGG 
CCTTCTCTGAGATCCGGATTGC. NIH/3T3 cells were cultured in 10% bovine 
serum in DMEM (Invitrogen, # 11995-073) ina 12-well plate at a density of 1 X 10° 
cells per well. We co-transfected each well of NIH/3T3 cells with 10 ng of pGL3 
constructs, 100 ng of pRL-TK Renilla vector (Promega, #E2241), and 15 nM miR- 
34b miRNA mimics (Integrated DNA Technologies, 5’-AGGCAGUGUAAUU 
AGCUGAUUGU-3’ and 5’-AAUCACUAACUCCACUGUUAUC-3’) or siGFP 
(Integrated DNA Technologies) using TransIT-TKO Transfection Reagent (Mirus 


Bio, # MIR 2150). At 24h after transfection, firefly and Renilla luciferase activities 
were measured using the Dual-Luciferase reporter assay system (Promega, #E1910). 
The luciferase activity was normalized as the ratio of firefly/Renilla luciferase activities. 
Western blotting. Protein was collected from mouse tracheal epithelium by incu- 
bating resected and cleaned trachea in radioimmunoprecipitation assay (RIPA) 
buffer with complete mini protease inhibitor cocktail tablet (Roche, #11836153001) 
for 30 min on ice. Western blot followed the standard protocols. mouse anti-B-actin 
(Sigma, #45441, ~46 kDa) was used as loading control at 1:40,000 dilution. Rabbit 
anti-Cp110 (Thermo Scientific, # PA5-34380, ~120 kDa) was used at 1:1,000 dilu- 
tion in 5% non-fat dry milk in TBS-T (20mM Tris pH 7.6, 150 mM NaCl and 
0.1% Tween-20). Horseradish peroxidase (HRP) conjugated secondary antibodies 
(Santa Cruz Biotechnology, #sc-2004 and #sc-2005) were used at 1:5,000 dilution. 
Blots were analysed using ImageJ. Band intensities were normalized against cor- 
responding actin and compared to wild-type controls for ratio calculation. 
Manipulation of Xenopus laevis embryos and skin explants. X. laevis eggs were 
collected and in vitro fertilized, then cultured and microinjected by standard 
procedures*®. Embryos were injected with morpholino nucleotides (MOs, Gene 
Tools) at the two- to four-cell stage using a PicoSpritzer setup in 1/3 modified 
frog Ringer’s solution (MR) with 2.5% Ficoll PM 400 (GE Healthcare 17-0300-50), 
and then were transferred into 1/3X MR containing gentamycin” after injection. 
Drop size was calibrated to about 7-8 nl per injection. Rhodamine-B dextran (0.5- 
1.0 mgm’; Invitrogen D1841) was co-injected and used as lineage tracer. MO 
(Gene Tools) doses were administered as follows: a total dose of 30 ng miR-34/449 
MOs; 10 ng each for miR-34a MO (5’-CAACAACCAGCTAAGACACTGCCAA- 
3’), miR-34b MO (5'-ACAATCAGCTAACTACACTGCCTGA-3’), and miR-449a 
MO (5'-AACCAGCTAACATTACACTGCCTT-3’), or 30 ng of a miR-control MO 
(5'-TGCACGTTTCAATACAGACCGT-3’), or 17 ng of cp110 MO (5'-ACTCTT 
CATATGGCTCCATGGTCCC-3’). AnmRNA encoding Centrin-4—RFP or Centrin- 
GFP” and Sas6-GFP” was prepared using the Ambion message machine kit using 
SP6 (AM1340) and diluted to 50-100 ng pl for injection into the embryos (0.8- 
1.6 ng total per embryo). Xenopus tropicalis cp110 cDNA (PureYield Midiprep; 
Promega, #A2495) was derived from a clone matching BC167469 obtained from 
Thermo Scientific (#MXT1765-202715711). Cp110A3'UTR was generated from 
the same clone, which was digested with Sph1 (New England Biolabs, #R0182S) 
and re-ligated to remove most of the 3’ UTR. Analysis of morphant tadpole brains 
for signs of hydrocephalus was performed as previously described”. 

Xenopus skin explants were generated from animal caps**, dissected in 1X 

modified Barth’s saline from stage 9 embryos, which were either un-injected (time 
course experiment) or injected with Ctrl MO or miR-34/449 MOs (for quantifi- 
cation of cp110 and foxj1 expression). Explants were cultured in 0.5 modified 
Barth’s saline until un-manipulated control embryos reached appropriate stage. In 
the time course experiment, stage 10, 26 and 32 explants represented ciliation state 
of MCCs for quantitation of cp110, foxj1 and miR-34/449 miRNAs. In addition, 
cp110and foxj1 expression levels in Ctrl MO and miR-34/449 MOs injected embryos 
(injected 4X into the animal hemisphere at the 4-cell stage) were assessed at onset 
of ciliation (stage 26 explants) to examine the effect of miR-34/449 on foxjl and 
cp110 levels. Statistical evaluation of experimental data was performed using chi- 
squared tests (http://www.physics.csbsju.edu/stats/contingency.html) or Wilcoxon 
sum of ranks (Mann-Whitney) tests (http://www.fon.hum.uva.nl/Service/Statistics/ 
Wilcoxon_Test.html). 
Sample size and analysis. Sample sizes for all experiments were chosen based on 
previous experiences. No randomization or blinding was applied for all our studies. 
Ethics statement on animal experiments. This work was done with approval of 
University of California, Berkeley’s Animal Care and Use Committee. University 
of California, Berkeley's assurance number is A3084-01, and is on file at the National 
Institutes of Health Office of Laboratory Animal Welfare. 
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Extended Data Figure 1 | The generation and phenotypic characterization 
of mir-34/449 TKO mice. a, miR-34/449 miRNAs are evolutionarily 
conserved with extensive sequence homology across many species. miR-34a 
has a more ancient evolutionary history compared to the rest of miR-34/449 
miRNAs. miR-34a is conserved in Deuterostome, Ecdysozoa and 
Lophotrochozoa, yet the rest of miR-34/449 miRNAs have only vertebrate 
homologues. b, Diagrams of the targeted deletion strategy to generate mir-449 
knockout mice. Since all mir-449 miRNAs are within intron 2 of their host gene, 
cdc20b, we deleted mir-449 with a minimally predicted impact on cdc20b. 

c, miR-449 expression is absent in mir-449 ‘~ knockout animals, as 
demonstrated by real-time PCR analyses in lung tissues from littermate- 
controlled wild-type and mir-449 ‘~ mice at postnatal day 35 (n = 3). 
miR-449a real-time PCR primers exhibit a modest cross-reaction with miR-34 


miRNAs. d, mir-34/449 TKO mice have a significant postnatal attenuation in 
body weight. Littermate-controlled mir-34a ‘~; mir-34b/34c/~, mir-34a~; 
mir-34b/34c_/~; mir-449*/— and TKO mice were monitored for their body 
weight every other day for 30 days after birth. Paired t-test, ***P < 0.001. 

e, Surviving mir-34/449 TKO mice exhibit coughing/sneezing-like phenotype. 
The respiratory noise of littermate-controlled mir-34a ‘—; mir-449-/— DKO 
and TKO mice was shown by sound wave analysis at postnatal day 30 (n = 14). 
f, Pulmonary inflammation occurs in a subset of mir-34/449 TKO mice. A 
representative H&E analysis of lung tissues from an adult TKO mouse indicates 
an increased infiltration by inflammatory cells. A total of 3 out of 15 TKO mice 
examined exhibit lung infection. g, mir-34a /~; mir-34b/34c_/" DKO mice 
resemble wild-type mice, exhibiting no obvious respiratory defects in paranasal 
sinus or lung (n = 3). All error bars represent s.e.m. 
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Extended Data Figure 2 | Phenotypic characterization of reproductive 
organs and brain in mir-34/449 TKO mice. a, Adult male and female 
mir-34/449 TKO mice are infertile. Male (left) and female (right) reproductive 
organs from littermate-controlled DKO and TKO mice were subjected to H&E 
staining (n = 3). Boxes indicate areas depicted in Fig, le. b, Adult mir-34a/"; 
mir-449 ‘~ DKO male mice exhibit no defects in spermatogenesis. c, Adult 
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mir-34a_‘—; mir-34b/34c_/— DKO female mice display no defects in 
reproductive organs. d, The adult mir-34/449 TKO brains do not exhibit 


hydrocephalus, yet they are smaller in size than wild-type and DKO controls. 
The a/b markings on the images indicate the coronal to horizontal ratios. n = 3 


for b, c and d. 
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Extended Data Figure 3 | miR-34/449 miRNAs are enriched in airway 
MCCs. a, Most miR-34/449 miRNAs are enriched in tissues with motile cilia. 
Using real-time PCR, the expression of miR-34a, miR-34c and miR-449c was 
measured in multiple tissues from newborn, P10, P20 and adult wild-type mice. 
Both miR-34c and miR-449c are exclusively expressed in tissues with motile 
cilia, whereas miR-34a exhibits a broader expression pattern (n = 3). b, The 
real-time PCR assay for each miR-34/449 miRNA specifically detects the 
corresponding miRNA. The specificity of each miRNA real-time PCR assay 
was validated using testis RNA from wild-type (WT), mir-34a /~, mir-34b/ 
34c/~, mir-449-’~, and TKO mice at postnatal day 35. The miR-449a assay 
shows a slight cross reaction with homologous miRNAs (n = 3). ¢, In situ 
hybridization of each miR-34/449 miRNA exhibits specific detection. No 


ie} 
x< 
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mir-34a"; mir-449" 


measurable miR-34/449 in situ signal is detected in TKO lung sections at 
postnatal day 25 (n = 2). d, miR-34/449 miRNAs are enriched in tracheal 
MCCs. In situ hybridization analyses demonstrate that miR-34c and miR-449c 
are specifically expressed in the tracheal MCCs, whereas miR-34a is expressed 
in both tracheal MCCs and the surrounding cell types (n = 2). e, mir-34/449 
TKO mice do not exhibit significant alterations in Foxj1 expression. 
Quantification of Foxj1 positive cells (left, n = 3) and Foxj1 mRNA (right, 

n = 4) was performed for well-controlled wild-type, DKO and TKO tracheas, 
using immunofluorescence and real-time PCR, respectively. Paired t-test; 

ns, P> 0.05. f, mir-34a ‘~; mir-449-/~ DKO tracheal epithelia are 
morphologically indistinguishable from wild-type controls in scanning 
electron microscopy (SEM) analyses (n = 3). All error bars represent s.e.m. 
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Extended Data Figure 4 | miR-34/449 deficiency causes defective ciliation 
and basal body docking in mouse airway MCCs. a, mir-34/449 TKO trachea 
exhibit reduced MCC ciliation. Quantification of fully ciliated MCCs (y-tub 
and Ac-o.-tub double-positive) and partially/non-ciliated MCCs, Ac-o-tub 
weak/negative) was performed in littermate controlled DKO and TKO mouse 
tracheas, using data from all three experiments in Fig. 3a. The number of 
cells with MCC identity (y-tub positive) is unaffected in TKO tracheas, yet 
one-third of the TKO MCCs display aberrant Ac-o-tub staining, indicating 
ciliation defects. Paired t-test; ns, P > 0.05, ***P < 0.001. b, The mir-34a ’; 
mir-34b/34c ‘~ DKO mice exhibit normal ciliogenesis in tracheal MCCs. 
Whole tracheas from age-matched adult wild-type and mir-34a_‘~; mir-34b/ 
34c ‘~ DKO mice were analysed by immunofluorescence staining for 
Ac-a-tub (cilia) and y-tubulin (basal bodies) (n = 3). ¢, mir-34/449 TKO 
primary tracheal epithelial cells exhibit ciliation defects in air liquid interface 
(ALD culture. ALI culture of MCCs were derived from tracheas of littermate- 
controlled mir-34a/~; mir-34b/34c*/; mir-449-/— and TKO mice, and 
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subjected to immunofluorescence staining for Ac-a-tub (cilia) and y-tub 
(basal bodies). In TKO and control ALI culture, comparable levels of y-tub 
positive cells are observed, however a large portion of TKO y-tub positive cells 
displayed a partial or complete loss of Ac-a-tub staining. The letters on the 
image indicate fully (a), partially (b) or non-ciliated (c) MCCs (n = 2). d, Basal 
bodies fail to dock to the apical membrane of mir-34/449 TKO MCCs in ALI 
culture. Lateral projections of confocal micrographs described in (c) show 
impaired apical localization of y-tub staining in TKO MCCs from ALI cultures, 
suggesting a defective basal body docking to the apical membrane. 

e, mir-34a /~; mir-449-/— DKO trachea exhibit no defects in basal body 
docking using transmission electron microscopy (TEM) analyses (n = 3). 

f, TKO tracheal MCCs exhibit a defective subapical actin network. Whole 
tracheas from adult wild-type, mir-34a /~; mir-34b/34c_/" DKO and TKO 
mice were analysed by immunofluorescence staining for Ac-a-tub (cilia) and 
phalloidin-488 (actin) (n = 2). All error bars represent s.e.m. 
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Extended Data Figure 5 | Major basal body structural components are 
intact in mir-34/449 TKO MCCs revealed by transmission electron 
microscopy. a, b, Apically docked (a) and undocked (b) basal bodies in 
mir-34/449 TKO MCCs have intact structural components. Basal body 
transition fibres (top), basal feet (middle) and striated rootlets (bottom) have 
comparable morphology among WT, DKO and TKO MCCs. For the top panel 
an arrow indicates a representative transverse view of transition fibres. For the 
middle panel an arrow indicates a representative transverse view of nine 
microtubule triplets with basal feet and arrowhead indicates a representative 
transverse view collected from a different height of a basal body, containing 
nine microtubule triplets without basal feet. For the bottom panel, an arrow 


indicates the longitudinal view of basal feet and arrowhead indicates the striated 
rootlet structure. c, Directionality of basal bodies (top) and axonemes (middle) 
is moderately affected in mir-34/449 TKO MCCs. In the top panel arrows 
point to the directions indicated by basal foot. In the middle panel red lines 
connecting the central pair of axonemes indicate the rotational polarity of each 
ciliary axoneme. In the bottom panel the angles of the axoneme directionality 
were statistically analysed as bidirectional circular data. The average angel 
was set from 0° to 180° axis. mir-34/449 TKO ciliary axonemes have moderately 
un-coordinated directionality compared to WT and DKO controls. d, mir-34/ 
449 TKO axonemes exhibit intact structures, including nine outer microtubule 
doublets, two central microtubule singlets and dynein arms (n = 3). 
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Extended Data Figure 6 | miR-34/449 deficiency in frog MCCs causes 
defective ciliogenesis without affecting cell fate specification. a, Injection of 
Ctrl or miR-34/449 MOs does not affect general embryonic development or 
neural tube closure. Xenopus laevis embryos were injected unilaterally with 
MOs at the 2—4-cell stage and analysed at neurula stages (18-20). Targeting of 
the skin ectoderm was confirmed by co injection of fluorescent rhodamine 
dextran. b, Frog miR-34/449 morphants do not exhibit hydrocephalus. 
Embryos were injected animally with control or miR-34/449 MOs into both 
dorsal blastomeres at the 4-cell stage to target the neural tube and brain regions. 
Subsequently, the whole brains were dissected and analysed at stage 45/46. 
The lack of hydrocephalus in miR-34/449 morphants argues against a role of 
miR-34/449 in ependymal ciliation. c, Quantification of fully ciliated, partially 
ciliated or non-ciliated MCCs reveals no significant change in total number 
of MCC-fated cells in miR-34/449 morphants. Error bars represent s.d. 
Wilcoxon two-sample test, ns, P > 0.05. d, e, foxj1 expression and specification 
of MCC fate is unaltered in miR-34/449 deficient embryos. d, Embryos were 
unilaterally injected with Ctrl or miR-34/449 MOs to the right side at the 
2-4-cell stage, cultured until stage 21 or 32 and processed for in situ 
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hybridization to monitor foxj1 expression in the mucociliary epithelium of 
the skin. No change in foxj1 expression can be detected. e, Real-time PCR 
analysis in Ctrl or miR-34/449 MOs injected skin explants at stage 26 (onset of 
ciliation) does not indicate reduced expression levels of foxj1. Error bars 
represent s.e.m. f, miR-34/449 deficient frog embryos exhibit normal 
development of mucociliary cell types. Detailed analysis of the embryonic skin 
at stage 30-32 reveals the presence (specification and intercalation) of all cell 
types in miR-34/449 morphants, including large goblet cells, small secretory 
cells (SSC), Ac-o-tub positive ciliated cells (MCC) and non-tubulin enriched 
ion secreting cells (ISC). g, miR-34/449 morphant MCCs exhibit an uneven 
distribution of basal bodies. Sas6-gfp mRNA was injected at the 2—4-cell stage to 
visualize basal bodies at stage 30-32. In control embryos Sas6-GFP foci are 
evenly distributed in fully ciliated MCCs, whereas miR-34/449 morphant 
MCCs are characterized by an uneven distribution and aggregation of basal 
bodies, which frequently fail to grow cilia (Ac-a-tub staining). Such phenotype 
is characteristic for basal body docking defects. Total numbers of embryos/cells 
analysed were Ctrl MO (4/7) and miR-34/449 MOs (6/10). 
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Extended Data Figure 7 | cp110 is a direct target of miR-34/449 miRNAs. 
a, A schematic representation of two predicted miR-34/449 binding sites in the 
mouse Cp110 3’ UTR and in the luciferase reporter construct that contains 
Cp110 3’ UTR. b, Cp110 protein levels at postnatal day 23 are elevated in 
mir-34/449 TKO tracheal epithelia. c, The expression of Luc-Cp110-3' UTR 
exhibits miR-34b-dependent repression in NIH/3T3 cells. Error bars represent 
s.e.m., n = 3. Paired t-test, * P< 0.05. d, A schematic representation of one 
predicted miR-34/449 binding site in the frog cp110 3’UTR. A truncated cp110 
construct, cp110A3’UTR, was made to generate a cp110 cDNA without the 
miR-34/449 target site. e, Real-time PCR monitoring cp110 reveals elevated 
mRNA expression levels of cp110 in miR-34/449 morphant frog skin explants 
as compared to Ctrl MO injected specimens. **P<0.01. Paired t-test. Error 
bars, s.e.m. f, Timeline of MCC ciliation and recapitulation of ciliation defects 


in skin explants (animal caps). Representative confocal images from staged 
whole embryos and skin explants injected with either Ctrl MO or miR-34/449 
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79" 25° a 
MOs show the onset of ciliation at stage 26 and fully ciliated skin ectoderm at 
stage 32 in whole embryos and Ctrl MO injected skin explants. miR-34/449 
MOs injected skin explants develop MCC ciliogenesis defects comparable to 
whole embryo treatment. Cilia: Ac-a-Tub (red), Actin: Phalloidin-488 (green) 
and nuclei: DAPI (blue). g, Expression of cp110, foxjl and miR-34/449 

RNAs during time course of ciliation in skin explants. Explants at stage 10 
represent unciliated MCC precursors, explants at stage 26 represent MCCs 

at the onset of ciliation, and stage 32 explants represent fully ciliated ectodermal 
epithelium. cp110 mRNA levels decrease over the time course of ciliation, 
with the strongest decrease between stage 10 and 26, while foxj1 mRNA levels 
rapidly increase during this time. miR-34a, -34b and -449c levels strongly 
increase between stage 10 and stage 26; and only a moderate increase or 

even decrease can be observed between stage 26 and 32, similar to foxj1 
expression levels. Error bars represent s.e.m. n = 2, technical replicates on 
pools of 30 skin explants for each time point. 
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Extended Data Figure 8 | miR-34/449 miRNAs promote ciliogenesis by 
repressing cp110. a, Representative examples of confocal images used for 
quantification of MCC ciliation in (b). Embryos were stained for Ac-a-tub 
(cilia) and phalloidin-488 (actin). White boxes indicate areas depicted in 
Fig. 5c. b, Quantification of MCC ciliation in a, d and Fig. 5c. °-test, ns 
P>0.05, ***P < 0.001. c, Centrin4d-GFP incorporation into basal bodies is 
affected in miR-34/449 deficient embryos. The centrin4-gfp mRNA was 
injected at the 2—4-cell stage to visualize basal bodies in MCCs at stage 32, and 
centrosomes in neighbouring epithelial cells. In Ctrl morphant embryos, 
Centrin4-GFP staining in basal bodies (smaller foci in ciliated cells) and 
centrosomes (bigger foci in non-ciliated cells, green arrowheads) are equally 


cp110A3'UTR-)- - - - s = + 
#ofembyos 16 4 8 8 12 13 12 


strong. In contrast, Centrin4—GFP staining in basal bodies is greatly reduced 
in miR-34/449 morphants, without alteration of fluorescent intensity in 
centrosomes of neighbouring cells. Total numbers of embryos/cells analysed 
were Ctrl MO (6/17), miR-34/449 MOs (7/23). d, Representative examples 
of confocal images from cp110 overexpression experiments used for 
quantification of MCC ciliation in b. White boxes indicate areas depicted in 
Fig. 5c. e, The number of MCC-fated cells in miR-34/449 or cp110 morphants, 
and embryos injected with cp110 DNA constructs is not reduced. 
Quantification of total MCC numbers (fully ciliated, partially ciliated or 
non-ciliated MCCs) is shown for frog embryos injected with various 
MOs/DNAs (corresponding to a, b, d and Fig. 5c). Error bars represent s.d. 


©2014 Macmillan Publishers Limited. All rights reserved 


Uninjected cp110 A3'UTR 


Ac-a-tub/ Centrin4-RFP/Actin 


Centrin4-RFP 


Extended Data Figure 9 | Gain and loss of cp110 affects MCC basal bodies, 
but not the apical actin meshwork. a, cp110 overexpression phenocopies miR- 
34/449 knockdown. Centrin4—RFP enrichment is strongly reduced in 
cp110A3'UTR overexpressing MCCs. It is noteworthy, that whereas ciliation 
and incorporation of Centrin4 are strongly affected in cp110A3'UTR injected 
embryos, formation of the apical actin meshwork appears largely unaffected. 
Together with the lack of cp110 knockdown to rescue the apical actin 
meshwork in miR-34/449 morphants, these data indicate an additional effect 
of miR-34/449 miRNAs on Actin formation/organization, which is 

cp110 independent. Cilia: Ac-a-tub, basal bodies: Centrin4—REP, actin: 
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phalloidin-488. Embryos/cells analysed: uninjected (4/7), cp110A3'UTR 
(6/10). b, The cellular basis for ciliation defects in cp110 morphants is probably 
due to the atypical failure of basal bodies to separate from each other, thus they 
appear to be aggregated in clusters of cp110-deficient MCCs. Nevertheless, 
Centrin4-RFP incorporation or apical localization of basal bodies is not 
affected in cp110 morphants. Basal bodies, Centrin4—RFP; actin: phalloidin- 
488. Total numbers of embryos/cells analysed: un-injected (2/3), cp110 MO 
(5/8). Embryos were derived from at least two females and independent 
fertilizations per Xenopus experiment. 
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Extended Data Table 1 | Candidate miR-34/449 targets with poten- 
tial roles in MCC differentiation and/or ciliation in respiratory 


epithelia 


Gene Symbol__Gene Name 


Ank3 
Atp2b4 
Aurka 
Aurkb 
Ccedc39 
Ccdc40 
Cena2 
Cenb1 
Cend2 
Cone2 
Ccp110 
Cde25a 
Cdc6 
Cdea7! 
Cdk5rap2 
Cep152 
Cep63 
Cep97 
Chek1 
Daam1 
Dnah5 
Dnali1 
Dtl 
Dzip1 
Fat4 
Fgfr1 
Foxg1 
Foxj1 
Hdac6 
Hook3 
Ift27 
Itch 
Jag1 
Kif24 
Lef1 
Mapt 
Met 
Myb 
Myh9 
Pacs1 
Pdgfra 
Pofut1 
Rdh11 
Rfx3 
Rrm2 
Shank3 
Six3 
Skp2 
Stat6 
Stil 
Stk36 
Tmem107 
Tppp 
Tsc2 
Tte26 
THl3 
Xpnpep3 


ankyrin 3, epithelial 

ATPase, Ca++ transporting, plasma membrane 4 
aurora kinase A 

aurora kinase B 

coiled-coil domain containing 39 

coiled-coil domain containing 40 

cyclin A2 

cyclin B1 

cyclin D2 

cyclin E2 

centriolar coiled coil protein 110 

cell division cycle 25A 

cell division cycle 6 

cell division cycle associated 7 like 

CDKS regulatory subunit associated protein 2 
centrosomal protein 152 

centrosomal protein 63 

centrosomal protein 97 

checkpoint kinase 1 

dishevelled associated activator of morphogenesis 1 
dynein, axonemal, heavy chain 5 

dynein, axonemal, light intermediate polypeptide 1 
denticleless homolog (Drosophila) 

DAZ interacting protein 1 

FAT tumor suppressor homolog 4 (Drosophila) 
fibroblast growth factor receptor 

forkhead box G1 

forkhead box J1 

histone deacetylase 6 

hook homolog 3 (Drosophila) 

intraflagellar transport 27 

itchy, E3 ubiquitin protein ligase 

jagged 1 

kinesin family member 24 

lymphoid enhancer binding factor 1 
microtubule-associated protein tau 

met proto-oncogene 

myeloblastosis oncogene 

myosin, heavy polypeptide 9, non-muscle 
phosphofurin acidic cluster sorting protein 1 
platelet derived growth factor receptor, alpha polypeptide 
protein O-fucosyltransferase 1 

retinol dehydrogenase 11 

regulatory factor X, 3 

ribonucleotide reductase M2 

SH3/ankyrin domain gene 3 

sine oculis-related homeobox 3 

S-phase kinase-associated protein 2 (p45) 
signal transducer and activator of transcription 6 
Scl/Tal1 interrupting locus 

serine/threonine kinase 36 

transmembrane protein 107 

tubulin polymerization promoting protein 
tuberous sclerosis 2 

tetratricopeptide repeat domain 26 

tubulin tyrosine ligase-like family, member 3 
X-prolyl aminopeptidase (aminopeptidase P) 3, putative 
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Crystal structure of the human glucose 


transporter GLUTI1 


Dong Deng!*3*, Chao Xu'?**, Pengcheng Sun*, Jianping Wu!??*, Chuangye Yan'”, Mingxu Hu? & Nieng Yan! 


The glucose transporter GLUTI catalyses facilitative diffusion of glucose into erythrocytes and is responsible for glucose 
supply to the brain and other organs. Dysfunctional mutations may lead to GLUTI deficiency syndrome, whereas over- 
expression of GLUT] is a prognostic indicator for cancer. Despite decades of investigation, the structure of GLUT] remains 
unknown. Here we report the crystal structure of human GLUT] at 3.2 A resolution. The full-length protein, which has a 
canonical major facilitator superfamily fold, is captured in an inward-open conformation. This structure allows accurate 
mapping and potential mechanistic interpretation of disease-associated mutations in GLUT1. Structure-based analysis of 
these mutations provides an insight into the alternating access mechanism of GLUT1 and other members of the sugar 
porter subfamily. Structural comparison of the uniporter GLUT1 with its bacterial homologue XylE, a proton-coupled 
xylose symporter, allows examination of the transport mechanisms of both passive facilitators and active transporters. 


GLUT1, encoded by SLC2A1, mediates the basal-level cellular uptake 
of glucose into many tissues. In particular, it is responsible for constant 
uptake of glucose—maintained at approximately 5 mM concentration 
in blood—into erythrocytes through facilitative diffusion’ ’. GLUT] in 
endothelial cells of the blood—tissue barrier has an essential role for glu- 
cose supply to the brain and other organs**. 

Inactivating mutations of GLUT], resulting in compromised trans- 
port activities for glucose, are associated with diseases as a result of lack 
of energy supply to the brain’. GLUT1 deficiency syndrome (also known 
as De Vivo syndrome) is characterized by a spectrum of symptoms in- 
cluding early-onset seizures, microcephaly and retarded development* ”. 
Cancer cells require enhanced glucose supply in part due to the less ef- 
ficient energy production through anaerobic glycolysis (the Warburg 
effect). Elevated expression levels of GLUT1 have been observed in 
several cancer types, identifying GLUT] as an important prognostic indi- 
cator for tumorigenesis'*’’. Because of its fundamental physiological 
and pathophysiological significance, GLUT1 has been a focus for func- 
tional study and structural determination” ”. 

GLUT] belongs to the sugar porter subfamily of the major facilitator 
superfamily (MFS), one of the largest and most ubiquitous secondary 
transporter superfamilies****. MFS transporters share a conserved core 
fold that comprises 12 transmembrane segments organized into two dis- 
cretely folded domains, namely the amino- and carboxy-terminal domains. 
Within each domain, the six consecutive transmembrane segments are 
folded into a pair of ‘3+3’ inverted repeats*””°. Mounting experimental 
evidence indicates that the three-helix bundle may represent the basic 
structural and functional unit”*°. All MFS transporters are thought to 
transport substrate using the alternating access mechanism”, whereby 
the substrate-binding site is alternately accessible from either side of the 
membrane through conformational changes of the transporters. 

Structures of bacterial GLUT1 homologues—the D-xylose:H* sympor- 
ter XylE (refs 28, 29) from Escherichia coli and the glucose:H* symporter 
GlcP (ref. 30) from Staphylococcus epidermidis—have been reported. 
Notably, the structures of XylE bound to D-xylose or D-glucose enabled 
homology-based modelling of GLUT1 (ref. 28). However, both XylE 


and GlcP are proton-driven symporters whereas GLUT] is a uniporter 
that catalyses the translocation of glucose down its concentration gra- 
dient across the membrane. Atomic structure of human GLUT] is essen- 
tial for understanding its transport and disease mechanisms. 


Structure determination of GLUT1 


Details of subcloning, overexpression and purification of GLUT1 can 
be found in Methods. Full-length human GLUT] (residues 1-492) with 
a C-terminal 10 histidine tag was overexpressed in High Five insect 
cells infected with baculoviruses. To eliminate heterogeneity caused by 
glycosylation, we introduced a single point mutation (N45T). We tried 
but were unable to crystallize GLUT1(N45T). We reasoned that GLUT1, 
a transporter with high activity’', may exhibit multiple interchangeable 
conformations that impede crystallization. To solve this potential prob- 
lem, we searched the literature for GLUT] variants that may lock the uni- 
porter in certain conformations. A single missense mutation, E329Q, was 
previously shown to arrest GLUT] in an inward-facing conformation”. 
Weeventually succeeded in the crystallization of full-length GLUT1 con- 
taining the mutations N45T and E329Q. The crystals appeared in the C2 
space group and a few of them diffracted X-rays beyond 3.2 A at syn- 
chrotron radiation source. The structure of GLUT1 was determined using 
molecular replacement with the coordinates of the N and C domains of 
XylE as separate searching models. The final atomic model was refined to 
3.2 A resolution (Extended Data Fig. 1 and Extended Data Table 1). 

In the GLUT]1 structure, residues 9-455 constitute the canonical 
MFS fold whereas the C-terminal segment was invisible probably due 
to its inherent flexibility in this conformation (Fig. 1). The N and C 
domains enclose a cavity that opens to the intracellular side. The 
structure thereby represents an inward-open conformation, consist- 
ent with the predicted state for GLUT1(E329Q)*. Notably, the N and 
C domains are connected by an intracellular helical bundle (ICH) 
which comprises four short «-helices (Fig. 1). Intriguingly, the ICH 
domain was also observed in the structures of the sugar porter mem- 
bers XylE and GlcP, but not in other MFS transporters, suggesting that 
ICH may be a unique feature of the sugar porter subfamily”. 


1State Key Laboratory of Bio- membrane and Membrane Biotechnology, Tsinghua University, Beijing 100084, China. @Center for Structural Biology, School of Life Sciences and School of Medicine, Tsinghua 
University, Beijing 100084, China. ?Tsinghua-Peking Center for Life Sciences, Tsinghua University, Beijing 100084, China. 


*These authors contributed equally to this work. 
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Extracelluar 


Central cavity 


Figure 1 | Overall structure of the human glucose transporter GLUT1. The 
structure of full-length human GLUT] containing two point mutations (N45T, 
E329Q) was determined in an inward-open conformation. The side and 
cytoplasmic views are shown. The corresponding transmembrane segments in 
the four 3-helix repeats are coloured the same. The extracellular and 
intracellular helices are coloured blue and orange, respectively. A slab of cut- 
open view of the surface electrostatic potential, which was calculated with 
PyMol”®, is shown on the right to facilitate visualization of the inward-facing 
cavity. IC indicates intracellular helix. All structure figures were prepared with 
PyMol. 


Implications for ligand binding 


During refinement of the atomic model, a disc-shaped electron density 
with a discontinuous tail appeared in the inward-open cavity (Extended 
Data Fig. 2a). Because GLUT1 was purified and crystallized in the pres- 
ence of 0.4% (w/v) n-nonyl-B-D-glucopyranoside (B-NG), the observed 
electron density may come froma bound B-NG molecule, the sugar moi- 
ety of which happens to be the substrate for GLUT1. Indeed, a B- NG mol- 
ecule fits perfectly into the electron density, with the D-glucopyranoside 
bound to the C domain of GLUT] at approximately the centre of the 
membrane, whereas the aliphatic tail points to the intracellular side 
along the cavity (Extended Data Fig. 2b, c). 

The C domains of GLUT1 and XylE can be superimposed with a 
root-mean-squared deviation of 1.244 A over 138 aligned Ca atoms. On 
domain superimposition, D-glucopyranoside of the bound B-NG over- 
laps with D-glucose (Extended Data Fig. 2d). Reminiscent of the D-glucose 
coordination by XylE (ref. 28), the D-glucopyranoside of B-NG is hydrogen- 
bonded (H-bonded) to the surrounding polar residues in the C domain 
(Extended Data Fig. 2e). The similar coordination of the D-glucopyranoside 
of B-NG by inward-open GLUT] and D-glucose by outward-facing XylE 
supports the notion of a single sugar-binding site that is alternately ac- 
cessed from either side of the membrane’. In the outward-facing, 
partly occluded XylE, D-glucose is coordinated by residues mainly from 
the C domain”. In the inward-open GLUT], residues from the N do- 
main are not involved in ligand binding. These observations indicate 
that the C domain may provide the primary substrate-binding site 
whereas the relative motion of the N domain results in alternating access. 


Mapping of disease-related mutations 


The detailed structural information on GLUT1 provides a molecular 
basis for mechanistic interpretation of disease-derived inactivating mu- 
tations (Extended Data Table 2). These mutations were identified from 
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patients with GLUT] deficiency syndrome, early-onset absence epi- 
lepsy, paroxysmal exercise-induced dyskinesia, and familial idiopathic 
generalized epilepsy'***. Most of the mutations target polar or charged 
amino acids, which probably have a functional, rather than structural, 
role. It is possible that mutations that disrupt the structural integrity of 
GLUT1 cause complete loss of function and hence lethality’®. 

The disease-derived mutations largely map to three clusters in GLUT1 
(Fig. 2). Cluster I comprises residues that are responsible for substrate 
binding (Fig. 2 and Extended Data Fig. 2e); cluster IT is located at the inter- 
face between the transmembrane domain and the ICH domain (Figs 2 
and 3); and cluster III comprises residues lining the transport path, which 
are mostly committed to interactions between the N and C domains on 
the extracellular side (Figs 2 and 4 and Extended Data Fig. 3a). Muta- 
tions of the substrate-binding residues may lead to compromised recog- 
nition of D-glucose and markedly reduced transport activity***. Below, 
we focus on the analysis of clusters II and II which may facilitate 
understanding of the transport mechanism of GLUT1 and other sugar 
porter members. 


Critical role of the ICH domain 


In a simplified model, alternating access of the MFS transporters can 
be achieved through rigid-body rotation of the N and C domains”***. 
Structural comparison of the inward-open GLUT] and the outward- 
facing XylE reveals an approximately 16° concentric rotation of the two 
domains (Fig. 3a). GLUT1 and XylE share three intracellular helices (Ex- 
tended Data Fig. 4a). Intracellular helix 1 (IC1) and IC2 appear to main- 
tain a defined conformation with respect to the N domain, whereas IC3 
is pulled towards the C domain during the inter-domain rotation from 
outward-facing to inward-open (Fig. 3b and Extended Data Fig. 4b, c). 
To facilitate structural comparison, we will use the GLUT1 residue num- 
bers to annotate the conserved residues in XylE (Fig. 3c and Extended 
Data Fig. 5). 

Nearly half of the mutations derived from GLUT1 deficiency syn- 
drome map to residues that mediate the inter-domain contacts on the 
cytosolic side (Fig. 3c and Extended Data Fig. 5). In the outward-facing 
XylE, several conserved GLUT1-deficiency-sydrome-associated resi- 
dues contribute to the interactions between the N and C domains along 
the intracellular side of the membrane, thereby stabilizing the outward- 
facing conformation (Figs 2 and 3c). The guanidinium group of Arg 153 
at the intracellular end of transmembrane 5 (TM5) forms two hydrogen 
bonds with the carbonyl oxygen of Lys 458 which is on the loop pre- 
ceding ICS. Arg 212 on IC2 interacts with Glu 461 on IC5. Most notably, 
Asp 329 seems to have a critical role in the maintenance of the outward- 
facing conformation of XylE with its carboxylate group accepting two 


Figure 2 | Structural mapping of disease-derived mutations in GLUT1. The 
residues for which mutations have been identified in patients with GLUT1 
deficiency syndrome and other symptoms are coloured magenta. The orange 
circle and the grey and cyan shades indicate the three major clusters of residues 
for which mutations are associated with diseases. The N, C and ICH domains 
are coloured green, blue and yellow, respectively. The same colour scheme is 
used in all subsequent figures. Details of the mutations can be found in 
Extended Data Table 2. 


©2014 Macmillan Publishers Limited. All rights reserved 


XylE with residue numbering 
consistent with GLUT1 


hydrogen bonds from the backbone amide groups of Gly 154 and Lys 155. 
These hydrogen bonds are further stabilized by a pair of charge-stabi- 
lized hydrogen bonds between Asp 329 and Arg 333. 

Obviously, none of these interactions observed in XylE would exist 
in the inward-open conformation of GLUT] (Fig. 3c). Asp 329 in XylE 
is replaced by Glu in GLUT1. Assuming a similar set of interactions in 
GLUT1 as in XylE, the mutation E329Q is predicted to weaken or 
abolish the hydrogen bonds with the amide groups of Gly 154 and 
Ala 155. Consequently, GLUT 1(E329Q) may favour the inward-facing 
conformation as it probably encounters an increased energy barrier 
during the outward transition. Along this line, we predict that substi- 
tution of other residues in this interface may weaken the interactions 
between the N and C domains on the intracellular side, thus impeding 
the transport cycle by disfavouring the outward-facing conformation. 
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Figure 3 | The ICH domain serves 
as a latch that tightens the 
intracellular gate. a, Structural 
comparison of the inward-open 
GLUT1 to its E. coli homologue XylE 
(PDB accession code 4GC0)**. 

b, Structural overlay of GLUT1 and 
XylE using their respective N 
domains. For visual clarity, GLUT] is 
displayed in three colours and XylE is 
coloured grey. c, Analysis of the 
disease-related GLUT1 residues 

that mediate the inter-domain 
interactions on the intracellular side. 
Left: positioning and interactions of 
the disease-related residues on the 
intracellular side of the inward-open 
GLUT1. Right: the interaction 
network of the corresponding 
residues in the outward-facing XylE. 
The residues shown in this panel are 
conserved between GLUT1 and 
XylE, with the disease-related 
residues coloured and labelled 
magenta. The XylE residues are 
numbered according to their 
counterparts in GLUT1. Hydrogen 
bonds are represented by dashed lines. 


Analysis of the cluster II mutations suggests that the ICH domain, 
which represents a unique feature of the sugar porter subfamily, may 
function as a latch to secure closure of the intracellular gate in the 
outward-facing conformation. Supporting this notion, the interactions 
between the N and C domains in the transmembrane region of the outward- 
facing XylE are mediated by a limited number of hydrophobic residues 
(Extended Data Fig. 3b). Therefore, the inter-domain contacts invol- 
ving the charged or polar ICH residues are likely to have a critical role 
to close the transporter on the intracellular side during an alternating 
access cycle. 


The extracellular gate 


In the inward-open conformation of GLUT1, TM] and TM7 contact each 
other on the extracellular side, representing the major constituents of 


Figure 4 | Analysis of the 
extracellular gate of the inward- 
open GLUT1. a, TM1 and TM7 
represent the major constituents of 
the extracellular gate in the inward- 
open GLUT1. b, Several disease- 
related residues mediate the 
interactions between the N and C 
domains on the extracellular side of 
the inward-open GLUT. A close-up 
view of the hydrogen bond network 
is shown in the inset. c, Comparison 
of the local interactions for the 
disease-related residue Arg 126 in 
GLUT1 and the corresponding residue 
Arg 133 in XylE. The residues of XylE 
are labelled according to their 
counterparts in GLUT1 whereas 
their native numbers are indicated in 
brackets. d, Arg 126 on TM4 of GLUT1 
may form a cation—n interaction with 
Tyr 292 on TM7 of the C domain, 
contributing to the affinity of the 
extracellular gate. The structures of 
GLUT1 and XylE are superimposed 
using their respective N domains. 
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the extracellular gate (Fig. 4a). It is noteworthy that TM7 of GLUT1 con- 
tains one extra kink close to its extracellular end compared to that of the 
outward-facing XylE (Extended Data Fig. 4d). Notably, the mutations 
N34I, N34S, N34Y, S294P, T295M and T310L, affecting four amino acids, 
were isolated from GLUT] deficiency syndrome patients (Extended Data 
Table 2). Asn 34 on TM] appears to be an organizing centre that med- 
iates hydrogen bonds to Ser 294 and Thr 295 on TM7 and Thr 310 on 
TM8 (Fig. 4b). Together, this network of hydrogen bonds between the 
N and C domains occludes the ligand-binding site from the extracel- 
lular environment in the inward-facing conformation of GLUT1. Note 
that the residues Tyr 292, Ser 294 and Thr 295, which interact with the 
N domain, are located surrounding the discontinuous helical segment 
of TM7 (Fig. 4b). 

Among the disease-related residues, Arg 126 is of particular interest, 
because its counterpart in XylE, Arg 133, forms a network of hydrogen 
bonds with Asp 27, an essential residue for proton coupling in the 
outward-facing XylE (ref. 45). Asp 27 in XylE is replaced by Asn 29 
in GLUT1. Notably, Arg 126 does not interact with the side chain of 
Asn 29 in the inward-open GLUT1 (Fig. 4c). More importantly, the 
guanidinium group of Arg 126 is approximately 6 A away from the 
benzene ring of Tyr 292 on TMZ, probably forming a cation—n inter- 
action that further strengthens the extracellular gate in the inward- 
open GLUT] (Fig. 4d). 


Discussion 

Structural resolution of the human glucose transporter GLUT1 serves 
as a framework for understanding its functional mechanism. Compari- 
son of the proton-independent uniporter GLUT1 and its bacterial 
homologue, the proton symporter XylE, the structures of which were 
obtained in distinct conformations*”’, allows mechanistic analysis of 
facilitative diffusion versus active transport. Conclusions derived from 
such analysis may be generally applicable to other uniporters and proton- 
coupled symporters. On the basis of our structural analysis and published 
biochemical data, we propose a working model for GLUT1 (Fig. 5). 
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Figure 5 | A working model for GLUT1. Shown here are the predicted 
conformations—outward-open, ligand-bound and occluded, inward-open, 
and ligand-free and occluded—required for a complete transport cycle 
according to the alternating access model. The inward-open conformation of 
GLUT1 is reported in this study. The ligand-bound, occluded conformation 
and the ligand-free, occluded one are predicted from two XylE structures in the 
outward-facing, partly occluded and ligand-bound state*’, and the inward- 
facing, occluded state”, respectively. The outward-open structure remains to be 
captured. The ICH domain is illustrated as a latch that strengthens the 
intracellular gate in the outward-facing conformations. The extracellular gate 
comprises a few residues from TM1, TM4 and TM7 that are illustrated by the 
red brick in the ‘inward open’ cartoon. 
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Uniporters only catalyse the translocation of substrate down its con- 
centration gradient. The essential elements for a uniporter to complete 
a transport cycle are the conformational switches. In the case of GLUT1, 
because of the extensive interactions between TMD and ICH, the ligand- 
free protein may prefer an outward-open conformation. Substrate bind- 
ing at the primary site on the C domain may provide extra contacts 
with the N domain, inducing closure of the N and C domains on the 
extracellular side and leading to rearrangement of interactions on both 
sides of the bound substrate. When the binding affinity between the N 
and C domains on the extracellular side (Fig. 4) surpasses that of the 
intracellular side (Fig. 3c), the protein may switch to the inward-open 
conformation. Once GLUT1 adopts the inward-open conformation, 
the substrate is exposed to a low concentration environment; then the 
equilibrium is shifted towards substrate dissociation. The substrate-free 
uniporter is able to return to the outward-facing state. This model pre- 
dicts that substrate-free uniporters have a preferred open conformation, 
and substrate association and dissociation drive the conformational 
switches. 

Proton symporters exploit the transmembrane proton gradient (also 
known as the proton motive force’) to drive the ‘uphill’ translocation 
of substrate against its concentration gradient. The translocation of 
proton and substrate are obligatorily coupled. Notably, Asp 27 of XylE, 
corresponding to Asp 32 in GalP (ref. 48) and Asp 22 in GlcP (ref. 30), 
has a critical role in proton coupling. The mutation D27N in XylE or 
D22N in GlcP led to abrogation of proton-dependent active symport, 
but not the counterflow activity’. The structural consequence of pro- 
tonation of Asp is thought to be similar to Asn substitution. Therefore 
the structure of GLUT1, where Asn 29 is the counterpart of the critical 
Asp in the bacterial homologues, provides an opportunity to under- 
stand the protonated state of the proton symporters. 

Protonation of the key Asp residue releases the counterpart of Arg 126 
in GLUT] which is subsequently able to form a cation-n interaction 
with the aromatic residue in the C domain, an event that probably trig- 
gers the outward to inward transition (Fig. 4c, d). In this partial process, 
a uniporter can be regarded as a permanently protonated symporter. 
However, unlike uniporters which upon substrate release are able to 
return from inward- to outward-facing, a substrate-released symporter 
cannot undergo the conformational switch until it is deprotonated, and 
vice versa (that is, deprotonated symporter cannot go back to outward- 
facing unless the substrate is dissociated). Detailed mechanistic under- 
standing of the transport mechanisms of uniporters and symporters 
awaits additional biochemical and biophysical investigations. In par- 
ticular, the coupling mechanism of deprotonation, substrate release into 
a high-concentration environment, and the conformational switch from 
inward-facing to outward-facing of symporters remains to be elucidated. 

In summary, we report the long-sought-after crystal structure of the 
human glucose transporter GLUT1, which allows accurate mapping 
and potential mechanistic interpretation of disease-related mutations. 
Structural comparison of GLUT1 with XylE provides a foundation to 
understand the transport mechanisms of facilitative uniporters and 
proton-driven symporters. The structures of GLUT1, XylE and GlcP 
provide important insight into structural and mechanistic understand- 
ing of the sugar porter family whose members are involved in a wide 
variety of biological processes as well as biotechnological applications”. 
The structure also serves as a guiding principle for the development of 
potential therapeutic agents that target GLUT 1 and other physiologic- 
ally important MFS sugar transporters. 


METHODS SUMMARY 


To obtain a large quantity of GLUT1 protein suitable for crystallization trials, we 
exploited multiple recombinant expression systems, including E. coli, several yeast 
species, baculovirus-infected insect cells, and mammalian expression systems. On 
the basis of both protein yield and solution behaviour, insect cells were chosen for 
large-scale production of GLUT1. Treatment of the purified protein by deglyco- 
sylase resulted in shift to a lower band on SDS-PAGE by western blot, indicating 
post-translational glycosylation of GLUT1 during expression. To eliminate poten- 
tial heterogeneity caused by glycosylation, we introduced a single point mutation, 
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N45T, in GLUTI. A second point mutation, E329Q, was introduced in the hope of 
stabilizing GLUT! in a certain conformation. The full-length human GLUT1 
variant (residues 1-492, N45T and E329Q) with a C-terminal 10X His tag was 
overexpressed in the High Five insect cells infected with baculoviruses. Forty-eight 
hours after viral infection, the cells were collected and disrupted using dounce 
homogenizer. The protein was extracted from membrane fraction with 2% (w/v) 
n-dodecyl-B-b-maltoside (DDM) at 4 °C for 2h and purified with nickel affinity 
resin (Ni-NTA) in the presence of 0.05% (w/v) DDM and 5% glycerol (w/v). The 
protein eluted from Ni-NTA was concentrated to about 10 mg ml before apply- 
ing to size-exclusion chromatography (Superdex-200; GE Healthcare) in the buffer 
containing 25 mM MES pH 6.0, 150 mM NaCl, 5% glycerol, and 0.4% (w/v) B-NG. 
The peak fractions were collected for crystallization trials. Crystals appeared through 
the hanging-drop vapour diffusion method at 4 °C in the well buffer containing 
30% (w/v) PEG400, 0.1 M MES pH 6.0, and 0.1 M MgCl. Data sets were collected 
at Shanghai Synchrotron Radiation Facility (SSRF) beamline BL17U. The structure 
of GLUT was determined using molecular replacement with the coordinates of 
the N and C domains of XylE as separate searching models. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 

Protein expression and purification. The full-length human GLUT1 complemen- 
tary DNA was subcloned into the NdeI and Xhol sites of pFastBacl (Invitrogen) with 
a C-terminal 10X His tag. All GLUT1 mutants were generated with a standard PCR- 
based strategy. The recombinant GLUT1 was expressed using the pFastBac baculo- 
virus system (Invitrogen). Briefly, bacmid DNAs were generated in DH10Bac cells, 
and the resulting baculoviruses were generated and amplified in Sf insect cells 
(Invitrogen). GLUT1 was overexpressed in High Five insect cells (Invitrogen) grown 
in the SIM HF medium (Sino Biological Inc.). Forty-eight hours after viral infection, 
the cells were collected and homogenized in the buffer containing 25 mM Tris pH 8.0 
and 150 mM NaCl. The cells were disrupted using dounce homogenizer for 80 cycles 
on ice. Cell debris was removed by low-speed centrifugation for 10 min. The super- 
natant was applied to ultracentrifugation at 150,000g for 1 h. The membrane fraction 
was solubilized in TBS buffer (25 mM Tris pH 8.0 and 150 mM NaCl) containing 
protease inhibitors (aprotinin at 0.8 |1M, pepstatin at 2 1M, leupeptin at 5 pg ml‘) 
and 2% (w/v) n-dodecyl-B-p-maltoside (DDM, Anatrace) at 4°C for 2h. After a 
further step of ultracentrifugation at 150,000g for 30 min, the detergent-soluble frac- 
tion was harvested and incubated with nickel affinity resin (Ni-NTA, Qiagen) at 4 °C 
for 30 min. The resin was rinsed with the buffer containing 25 mM MES pH 6.0, 
150 mM NaCl, 30 mM imidazole, 5% glycerol (w/v), and 0.05% (w/v) DDM for 
three times. The protein was eluted from the affinity resin with the wash buffer plus 
300 mM imidazole. The protein was concentrated to about 10 mg ml before further 
purification by size-exclusion chromatography (Superdex-200; GE Healthcare) in 
the buffer containing 25 mM MES pH 6.0, 150 mM NaCl, 5% glycerol, and selected 
detergents. The peak fractions were collected and flash frozen in liquid nitrogen for 
crystallization. 

Crystallization. Extensive crystallization trials were performed for GLUT1 pro- 
teins purified in various detergents and in lipidic cubic phase. Most of the efforts 
failed to yield crystals. Finally, the full-length GLUT1 containing two point muta- 
tions N45T and E329Q purified in the presence of 0.4% (w/v) B-NG gave rise to 
crystals through hanging-drop vapour diffusion method in the well buffer contain- 
ing 30% (w/v) PEG400, 0.1 M MES pH 6.0 and 0.1 M MgCl, at 4 °C. The crystals 


appeared after 2 days and reached full size in 5 to 7 days. A few crystals diffracted 
X-rays to beyond 3.2 A at Shanghai Synchrotron Radiation Facility (SSRF) beam- 
line BL17U. 

Data collection and structure determination. All data sets were collected at 
SSRE beamline BL17U and processed with the HKL2000 packages”. Further 
processing was carried out with programs from the CCP4 suite’. Data collection 
and structure refinement statistics are summarized in Extended Data Table 1. The 
phase was solved by molecular replacement using PHASER® with XylE (PDB 
code 4GBY) as searching model. The model was first modified by CHAINSAW™, 
then the N domain and C domain were taken separately as input ensembles for 
PHASER. The model was further rebuilt in COOT® and refined with PHENIX®. 
The sequence docking was further aided by sequence alignment with XylE. 
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space group: C2 
Extended Data Figure 1 | Structure determination of GLUT1. a, The 2F, the space group C2. Each GLUT1 molecule is shown as rainbow-coloured 


— F, electron density map. The stereo-view map for one representative slab, ribbon, blue and red for the N and C termini, respectively. 
shown as cyan mesh, is contoured at 1.00. b, The crystal packing of GLUT] in 
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Extended Data Figure 2 | One B-NG molecule occupies the substrate- 
binding site of the inward-open GLUT1. a, The ‘omit’ electron density 
observed in the inward-open cavity of GLUT1. The electron density, shown as 
magenta mesh, is contoured at 3.00. The N, C and ICH domains are coloured 
green, blue and yellow, respectively. b, A B-NG molecule fits well into the 
electron density inside the cavity. The 2F, — F, electron density map (cyan 
mesh) for the B-NG molecule is contoured at 1.0c. c, The overall GLUT1 
structure with the bound B-NG molecule. B-NG is represented by white 
spheres. d, The coordination of the sugar moiety of B-NG by GLUT] is similar 
to the binding of D-glucose by XylE. The structures of GLUT1 (blue) and XylE 


extracellular *®: _ 
y ) = ‘ 
< 


we nc 


Ae 


(cyan) are superimposed relative to their respective C domains. The ligands are 
shown in stick representation. Despite the similarity between D-glucose and 
B-NG, we cannot exclude the possibility that presence of the aliphatic tail of 
B-NG in GLUT] may subtly affect positioning of the sugar moiety compared to 
D-glucose. e, Coordination of the D-glucopyranoside of S-NG by GLUT1. The 
D-glucopyranoside of §-NG is hydrogen-bonded to the surrounding polar 
residues in the C domain, including Gln 282/Gln 283/Asn 288 from TM7, 
Asn 317 from TM8, and Asn 415 from TM11. The residues whose mutations 
are associated with GLUT1 deficiency syndrome are labelled in magenta. 
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Extended Data Figure 3 | Interactions between the N and C domains mutations are associated with GLUT1 deficiency syndrome are labelled in 
observed in the inward-open GLUT1 and the outward-facing XylE. a, In magenta. b, In XylE, there are only limited interactions between the N and C 
GLUT1, the inter-domain contacts, mainly on the extracellular side, include domains at the transmembrane region. 

both van der Waals interactions and hydrogen bonds. Residues whose 
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GLUT1 XylE 


Extended Data Figure 4 | Conformational differences between GLUT1 and __ conformation. b, c, Structural superimpositions of GLUT1 and XyIE relative to 
XylE. a, Structural comparison of the inward-open GLUT] and the outward- __ their respective N domains (b) and C domains (c). A detailed analysis can be 


facing, partly occluded, ligand-bound XylE. Similar intracellular views are found in Figs 3 and 4. d, Conformational differences of TM7 between GLUT1 
shown for GLUT and XylE. Note that the C-terminal helix IC5, which was and XylE. Compared to that in XylE, the extracellular segment of TM7 in 
referred to as IC4 in a previous study of XylE (ref. 28), is invisible in the GLUT1 is further bent away from the transport path. 


structure of GLUT1 probably due to its inherent flexibility in this 
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GLUT1 ------------ MEPSSKKLTGRLMLAVGGAVLGSLO NAPQKVIEEFYNOTWVHRYG------------------------------- 53 
GLUT2 -------------- MTEDKVTGTLVFTVITAVLGSFQ NAPQQVIISHYRHVLGVPLDDRKAINNYVINSTDELPTISYSMNPKPTPWA 82 
GLUT3 -------------- MGTQKVTPALI FAITVATIGSFQ APEKI IKEF INKTLTDKGN----------------—---------—---- 51 
GLUT4 MPSGFQQIGSEDGEPPQQRVIGTLVLAVFSAVLGSLQ IAPOKVIEQSYNETWLGRQGP------------------------------ 66 
XylE -------------- MNTOYNSSYIFSITLVATLGGLL SG----TVESLNTVEVAPQN------------------------------- 47 
GLUT1 ---ESILPTTLTTLWSLS EEE oe Ee FVSAVLMGFSKLGKS--------------- FEMLILGRFIIGV 131 
GLUT2 EEETVAAAQLITMLWSLS = GMTASFFGGWLGDTLGRIKAMLVANILSLVGALLMGFSKLGPS--------------- HILIIAGRSISGL 163 
GLUT3 ---APPSEVLLTSLWSLS GMIGSF SVGLEVNRFGRRNSMLIVNLLAVTGGCFMGLCKVAKS--------------- VEMLILGRLVIGL 129 
GLUT4 EGPSSIPPGTLTTLWALS ze GMISSFLIGIISQWL LVNNVLAVLGGSLMGLANAAAS-----—----—---- YEMLILGRFLIGA 147 
XylE ----- LSESAANSLLGF! CII GGALGGYCSNRFGRRDSLKIAAVLFFISGVGSAWPELGFTSINPDNTVPVYLAGYVPEFVIYRIIGGI 138 
ae — —— GEES caer 

GLUT1 TTGFVPMYVGEVS PTALRGALGTLH@LGIVVGILIAQVFGLDSIMG------ NKDLWPLLLSIIFIPALLOCIVLPFCPESPRFLLINRNEE 221 
GLUT2 ISGLVPMY IGEIAPTALRGALGTFHeLAIVIGILISQIIGLEFILG------ NYDLWHILLGLSGVRAILOSLLLFFC YIKLDEE 253 
GLUT3 CTGFVPMYIGEISPTALRGAFGTLN@LGIVVGILVAQIFGLEFILG------ SEELWPLLLGFTILPATLOSAALPFC FLLINRKEE 219 
GLUT4 TSGLVPMYVGEIAPTHLRGALGTLNeLAIVIGILIAQVLGLESLLG------ TASLWPLLLGLTVLPALLOLVLLPFC YIIOQNLE 237 
SMLS FLMLLYT SRGKQE 234 

Ss ADVTHDLQEMKEESROMMREKKVT ILELERSPAYROPILIAVVLOLS@0! VOO--PVY 315 
LRGYDDVTKDINEMRKEREEASSEQKVSIIQLFTNSSYROQPILVALMLHVA eel TAGISK--PVY. 347 
QILORLWGTQDVSQDIQEMKDESARMSQEKOVTVLELFRVSSYROPIIISIVLOLS@@] DAGVQE--PLY: 313 
LTGWADVSGVLAELKDEKRKLERERPLSLLOLLGSRTHRQPLIIAVVLOLS eel TAGVGQ PAY. 331 
GILRKIMGNTLATQAVOQEIKHSLDHG--RKTGGRLLMFGVG----VIVIGVMLSI Feel LGASTDIALLO 323 

GLUT1 VNTAFTVVSLEVVE TLHLI GCATLMTIALALLEQLPWMSYLSIVAIFGFVAFFEVGPGPI PjFIVAELFSQGPRPAATAVAGFSj 411 
GLUT2 VNMVFTAVSVFL SLFLIGMSGMFVCAI FMSVGLVLLNKFSWMSYVSMIAI FLFVSFFEIGPGPIPyJFMVAEFFSQGPRP FSN) 443 
GLUT3 VNTIFTVVSLFL TLHMIGLGGMAFCSTLMTVSLLLKDNYNGMSFVCI GAILVEVAFFEIGPGPIPi/F 1VAELFSOGPRP. GCs) 409 
GLUT4 VNIVFTLVSVLL TLHLL ICGCAT LMTVALLLLERVPAMSYVSIVAIFGFVAFFEIGPGPI P/F 1VAELFSQGPRP. GFSN) 427 
XylE INLTFTVLAIMTVDKFGRKPLOII IG--MFSLGTAFYTQAPGI--VALLSMLF SI LSEIFPNAIRGKALAI 415 

a —— 

GLUT1 WTISNFIVGMCFoYV------- EQLCGPYVFIIFTVLLVLFFIFTYF KGRTFDEIASGFR-OG-GASQSDKTPEELFHPLGADSQV 492 
GLUT2 FIVALCFQYI------- ADFCGPYVFFLFAGVLLAFTLETFF KGKSFEEIAAEFQKKSGSAHRP--KAAVEMKFLGATETV 524 
GLUT3 WTISNFLVGLLFPSA------- AHYLGAYVFIIFTGFLITFLAFTFF RGRTFEDITRAFEGQAHGADRSGKDGVMEMNSIEPAKETTINV 496 
GLUT4 WISNFIIGMGFOYV------- AEAMGPYVFLLFAVLLLGFFIFTFL RGRTFDOISAAFH-RTPSLLEQEVKPSTELEYLGPDEND 509 
XylE YFVSWTF PMMDKNSWLVAHFHNGF'S YWIYGCMGVLAALEMWK KGKTLEELEALWEPET------—— KKTOQOTATL 491 


Extended Data Figure 5 | Sequence alignment of GLUT1-4 with XylE. 
Secondary structural elements of GLUT1 are indicated above the sequence 
alignment. Invariant and highly conserved amino acids are shaded yellow and 
grey, respectively. The conserved sugar porter family signature motifs are 


underscored with red lines. The residues that are hydrogen-bonded to 
D-glucose in XylE are shaded red. The GLUT! residues whose mutations were 
found in GLUT] deficiency syndrome are indicated by black circles above. The 
sequences were aligned with ClustalW. 
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Extended Data Table 1 | Statistics of data collection and refinement for GLUT1 


Protein GLUTI1 
Integration Package HKL2000 
Space Group C2 
Unit Cell (A) 120.51, 102.00, 65.60 
Unit Cell (°) 90, 101.27, 90 
Wavelength (A) 0.9791 
Resolution (A) 40~3.20 (3.31~3.20) 
Rares (%) 10.2 (53.8) 
I/sigma 17.6 (3.2) 
Completeness (%) 98.2 (97.2) 
Number of measured reflections 52,880 
Number of unique reflections 12,834 
Redundancy 4.1 (4.2) 
Rwork / Reree (%) 24.14/27.35 
No. atoms 
Protein 3,469 
main chain 1,788 
side chain 1,681 
Others 21 
Average B value (A’) 
Protein 99.13 
main chain 97.56 
side chain 100.80 
Others 82.02 
R.m.s. deviations 
Bonds (A) 0.011 
Angle (°) 1.473 
Ramachandran plot statistics (“%) 
Most favorable 91.7 
Additionally allowed 8.1 
Generously allowed 0.3 
Disallowed 0.0 


Five crystals were used for data collection. Values in parentheses are for the highest resolution shell. 
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Extended Data Table 2 | Summary of disease-related sequence variations of GLUT1 


Variants Disease Variants Disease 
N341°° DS2 R223P"! EIGI2 
N348°” DSI R223W”’ DSI 
N34Y”° DSI R232C" FIG12 
S66F* DSI K256E** DSI 
G91D" DSI AQTST” DS2 
R92W”? DS2 A282-285°8 DS2 
R93W* DS2 Y292YY~’ DSI 
s9sI*! DS2 s294P* DS2 
M96Vv*° DSI T295M*° DSI 
R126C* DSI V303L” DSI 
R126H™ DSI T3101 DS1 
R126L** DSI G3148"! DS2 
G1308*° DSI N317T°? DS2 
E146K°* DS1 $324L"! DS2 
R153C°8 DS1 E329Q*” DS1 
R1I53H* DS2 R333Q” DSI 
A155V”° DSI R333W** DS1 
V1651" DS2 G382D* DSI 
A169°7 DSI A405D*° DSI 
R212C”° DSI R468W”’ DSI 
R212H*° DSI P485L*° DSI 


DS1/2 stands for GLUT1 deficiency syndrome 1 or 2. EIG12 is short for ‘epilepsy, idiopathic generalized 12’. The superscript numbers following each mutation are reference numbers (refs 7, 34, 36-44, 57-60). 
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Dynamically important magnetic fields near 
accreting supermassive black holes 


M. Zamaninasab!, E. Clausen-Brown!, T. Savolainen! & A. Tchekhovskoy** 


Accreting supermassive black holes at the centres of active galaxies often 
produce ‘jets —collimated bipolar outflows of relativistic particles'. Magnetic 
fields probably play a critical role in jet formation’ and in accretion 
disk physics*. A dynamically important magnetic field was recently 
found near the Galactic Centre black hole’. If this is common and if the 
field continues to near the black hole event horizon, disk structures 
will be affected, invalidating assumptions made in standard models**”. 
Here we report that jet magnetic field and accretion disk luminosity are 
tightly correlated over seven orders of magnitude for a sample of 76 
radio-loud active galaxies. We conclude that the jet-launching regions 
of these radio-loud galaxies are threaded by dynamically important 
fields, which will affect the disk properties. These fields obstruct gas 
infall, compress the accretion disk vertically, slow down the disk rota- 
tion by carrying away its angular momentum in an outflow’ and deter- 
mine the directionality of jets. 

General relativistic magnetohydrodynamic simulations find that if 
the accretion disk is threaded with enough poloidal magnetic flux, the 
flux will be transported inwards and accumulate in the central region 
of the disk until the ram pressure of the accreting gas is balanced by the 
magnetic pressure’”. (Poloidal refers to a vector pointing along the radial 
and axial directions in a cylindrical coordinate system, with the axial direc- 
tion aligned with the disk angular momentum axis.) The poloidal flux 
threading the black hole, ®py, then naturally reaches a saturation or 


equilibrium value of ~ 50 (m1 re c) , where M is the mass accretion rate, 


T= GMIc* the black hole gravitational radius, G the gravitational con- 
stant, M the black hole mass, and c the speed of light. The magnetic field 
then dominates the plasma dynamics of the inner disk and modifies the 


disk structure such that it becomes a ‘magnetically arrested disk*°’. 
Simulations of magnetically arrested disks also find that if the black hole’s 


dimensionless spin parameter is a,. 20.5 (a, = 0 refers to a non-rotating 
black hole and a, = 1 a maximally spinning black hole), then highly 
relativistic bipolar jets powered by the black hole’s rotational energy via 
the Blandford-Znajek effect? are ejected”"” with a total power of ~ Mc’. 
Among active galactic nuclei (AGN), only about 10% contain powerful 
radio-emitting jets (that is, are radio-loud AGN)". 

Although we cannot directly measure ®gy, we can observationally 
infer the poloidal magnetic flux threading parsec-scale jets, Pj. which 
by the flux freezing approximation is the same as ®gy for jets produced 
via the Blandford—Znajek effect (Fig. 1). This inference can be per- 
formed via high-angular-resolution radio observations of the so-called 
core-shift effect’? (see Methods), which gives a measure of the jet’s 
co-moving azimuthal magnetic field, B’, = B, /I” (where primes refer 
to jet co-moving frame quantities, and J” is jet bulk Lorentz factor). 
According to standard Blandford—Znajek jet theory, By/B, o a.Rj/ry, 
where R,; is jet cylindrical radius, B, is the poloidal magnetic field and 


1/2 


TH=Tg (1 +(1i- a.) is the black hole event horizon radius. 


Therefore, Din~R; By ocM R,By, and a more detailed derivation at 
one parsec downstream of the black hole yields: 


M 1{B, 
= 34 pe 
Pj, = 1.2 X 10 f (ax) L8; a -| Fe] (1) 


Here f(a.) =a, !ry/ Tg, Bis is the jet’s co-moving frame magnetic field 
measured by the core-shift effect, 0; is the jet opening angle and je, is in 
units of G cm’ (for a full derivation of equation (1), see Methods). 

The amount of magnetic flux predicted to thread a black hole in the 
magnetically arrested disk model can be calculated for a given source if 
its mass and accretion disk luminosity are known. This predicted flux 


Ltt - 


10! 102 10° 104 106 


Z (ry) 
Figure 1 | Diagram of an AGN jet. At left, the large black dot represents the 
black hole, with the disk (seen side-on) extending up and down; z (horizontal 
axis) represents the distance above the disk, in units of Tg. The radio core is the 
bright feature at the upstream end of the parsec-scale jet in the embedded radio 
image and it corresponds to the place in the jet where the optical depth t, = 1 
for an observing frequency, v. The frequency-dependent positional shift of this 
core provides an estimate for the magnetic field at one parsec, B,p., which we 


use to derive the poloidal magnetic flux threading parsec-scale jets, Oj... This, 


10 10” 


by the flux freezing approximation, should be the same as the magnetic 
flux threading the black hole, gy. We compare ®,,, to the predicted 
amount of magnetic flux corresponding to a magnetically arrested disk, 


. 1/2 
50 (a1 te c) oc Li? M, where Lac is the accretion disk luminosity. I” is the jet 


bulk Lorentz factor and characterizes the jet velocity. This figure makes use of 
data from the MOJAVE database (http://www.physics.purdue.edu/astro/ 
mojave/). 


1Max-Planck-Institut flr Radioastronomie, Auf dem Higel 69, 53121 Bonn, Germany. *Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, California 94720, USA. 7Department of 
Astronomy and Theoretical Astrophysics Center, University of California, Berkeley, California 94720-3411, USA. 
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scales as 50 (1 te c) oc i M, where L,,, is the accretion disk lumin- 


osity. The disk luminosity enters this relation via the radiative effi- 
ciency 7, where Lace = nM c’; in this work we use 7 = 0.4 (see Methods 
for details). 

We test the magnetically arrested disk model by plotting the values 
of D-, versus i M for a sample of radio-loud AGN with measured 
core-shifts and accretion disk luminosities. If all sources contain mag- 
netically arrested disks, then the data in this plot will display two qual- 
ities: (1) there will bea positive correlation between Pj, and ue M, and 
(2) the data points will be scattered around the theoretical curve defined 


. 1/2 
by the relation ®.,~50 (m1 i c) 


We use measured core-shifts from a sample of 191 radio-loud AG 
and from individual nearby sources'*'*"* (see Extended Data Tables 1 
and 2). We have also searched the literature for available estimates 
of black hole mass for the above-mentioned radio sources and found 
reliable estimates for 76 of them (see Methods). The masses are estimated 
assuming that the dynamics of the broad line region around the central 
black hole are governed by the black hole’s gravity. Well-established 
empirical correlations are then used to relate the observed luminosity 
of the optical lines to the size of the broad line region, while line widths are 
used to estimate the gas velocity in this region. From the size and velocity 
estimates the virial mass for the central body can be calculated’**'. We 
have also used reported’*** broad line luminosities as a proxy for the 
accretion disk luminosities, Lac. 

We calculate ®;., and i M for each source in our sample, assum- 
ing a. = 1.0, and plot our, results in Fig. 2. A partial Kendall correlation 
test between ®j, and Ee M, considering the common dependence of 
these two quantities on redshift and mass, confirms a positive correla- 
tion (over 30 significance), which is not an artefact caused by either 
distance- or mass-driven selection biases (see Methods and Extended 
Data Tables 3 and 4 for more details). The scatter in this correlation may 
be caused by deviations from our assumptions that a, = 1 and 7 = 0.4, 
as well as by observational errors. For example, most black hole spins 
are predicted” to fall in the range 0.3Sa, <1, and y may be different 
for each source, especially for low-luminosity AGN in our sample 
which are expected™* to have 70.4. Given that our sample is made 
up of diverse sources with different accretion histories, this correlation 
might seem surprising. However, magnetically arrested disk theory 
nicely explains this because it predicts a saturation value of magnetic 
flux that is mostly independent of initial conditions". 
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Figure 2 | Measured magnetic flux of the jet, H-,, versus Le? M. Here we assume 
that It 6; = 1; we also assume an accretion sadistic efficiency of 7 = 0.4 for our 
sample of 76 sources. The dashed line shows the theoretical prediction based on 
the magnetically arrested disk model. Filled and open circles represent blazars 
and radio galaxies, respectively (see Methods for details). 
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; 1/2 
We fit the equation Dj = ogy (sr2c) to the data by varying 


the dimensionless free parameter ¢py;, which can be interpreted as a 
dimensionless magnetic flux and is predicted to be ¢gy ~ 50 for mag- 
netically arrested disks. We find a best fit, py = (52 +5) 0;, where 
the error is the 1¢ confidence interval. The best fit value for dgy is 
expressed in terms of IO; because this value is not well known. AGN jet 
launching models” typically find 10; ~ 1, implying that most of the 
sources are close to the magnetically arrested disk prediction, py ~ 50, 
and their magnetic fields are dynamically important. However, we note 
that in the less luminous regions of jets that are well downstream of the 
bright radio core where our magnetic field measurements are taken, the 
typical value of I; is*® 0.1-0.2, which would give $y} ~ 10 for most 
sources. Even if this low value of I" 0; holds in the radio cores, it would 
still imply that a fraction of the sources, including the M87 galaxy, 
have dynamically important magnetic fields near their central black 
holes. Because our magnetic field measurements use emission located 
closer to the jet launching regions, we assume I; ~ 1 as predicted by jet 
launching models, and therefore ¢gy ~ 50 in agreement with magnet- 
ically arrested disk predictions. Thus, our data confirm the magnetically 
arrested disk model because it displays, as discussed above, a positive 
correlation between ®,., and ie M, and the data points are scattered 


around the theoretical curve defined by ®,.~50 (1 r 20), 


Our data provide direct observational evidence that the inner accre- 
tion disks of radio-loud AGN contain strong, dynamically important 
magnetic fields regulated by the mass accretion rate. As most models 
of black hole accretion disks rely on the assumption that the magnetic 
pressure in the disk body is much less than the plasma pressure, our 
findings imply that these models may require significant changes. In 
particular, attempts to model the silhouette of the central black hole in 
the M87 galaxy and the spectral energy distributions of X-ray binaries 
hosting strong radio jets may be in need of significant revision. Models 
of the Galactic Centre accretion disk may also need to be revised, as a 
dynamically important magnetic field has been reported® within a 
distance of ~3 X 10’r, from the central black hole. 


. 2 
The quantitative agreement between ®,,, and 50( M a c) demon- 


strates that our current theoretical models of magnetically arrested disks 
capture the most important processes in the accretion flow responsible 
for jet formation. This also implies that most, if not all, radio-loud objects 
contain dynamically important magnetic flux near their central black 
holes and that it is the magnetic fields twisted by the rotation of these 
black holes that power their jets (that is, the Blandford—Znajek mech- 
anism’). These twisted large-scale magnetic fields transfer energy (in the 
form of Poynting flux) from the rotating black holes out to parsec-scale 
distances, where their strength can be estimated by the core-shift effect. 
Our results are consistent with the proposal that radio-loud AGN con- 
sist of those black hole systems whose environment/accretion history 
is conducive to the formation of magnetically arrested disks, whereas 
radio-quiet AGN (that is, AGN without powerful jets) have failed 
to form magnetically arrested disks’. The idea that radio-quiet AGN 
are failed magnetically arrested disks is bolstered by the few studies 
of black hole spin made for radio-quiet AGN”*, which find close to 
maximally spinning black holes, implying that the Blandford—Znajek 
power (which presumably controls radio-loudness) must be low due 
to low magnetic flux threading their black holes. The importance of 
black hole feedback in part depends on jet power, which is typically 
assumed””*® to be ~0.1Mc?. However, our findings imply that much 
higher jet powers of ~Mc* are common’, hence suggesting that jets 
play a more important role in the process of AGN feedback than 
typically assumed. 


METHODS SUMMARY 

Our sample consists of two types of source: (1) blazars (68 sources), which are AGN 
with jets directed almost at Earth such that their emission is significantly boosted by 
relativistic effects, and (2) nearby radio galaxies (8 sources), which are AGN with 
jets that are typically more closely aligned with the plane of the sky. 
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To obtain Di for each source, we use published measurements of the core-shift, 
which is the angular distance that the radio core of each source shifts on the sky 
when observed at different frequencies using very long baseline interferometry 
(VLBI). We calculate the jet co-moving frame magnetic field strength one parsec 
downstream of the black hole, B),,., by using standard theoretical assumptions 
about the radio core’s emission and absorption mechanism (the synchrotron pro- 
cess), the ratio of magnetic energy to particle energy, and the flux freezing approxi- 
mation”. The results of analytical and numerical models of jets** are then used to 
calculate ®;., from Bi,,..- 

For each source, we calculate M from the L,., value derived from that source’s 
line luminosities (HB or Mg! or O 111), assuming that all the sources in our sample 
have the same radiative efficiency of 1 = 0.4. Although there are theoretical argu- 
ments that magnetically arrested disks could have slightly higher or lower y, our 
results are not particularly sensitive to the assumed value of 1 because the value of 
pep which we set out to measure, is proportional to ni! 2. 

The correlation we find between Pj, and ue M (in excess of 3c) accounts for all 
of our data including a number of upper limits. We quantify the scatter in our corre- 
lation by finding the two theoretical curves defined by #gy that encompass the 
middle 68% of our data, which gives @py = 24 and ¢gy = 93. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper 
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METHODS 


Data. Our sample of radio-loud AGN can be subdivided into two classes of objects: 
blazars and nearby radio galaxies. Extended Data Table 1 lists the properties of 68 
blazars in our sample, including several well studied ones (3C 454.3, 3C 279 and BL 
Lac)'*. We searched the literature and found optical line observations that we used 
to calculate the mass of the corresponding black holes and the bolometric lumin- 
osities of their accretion disks. 

Extended Data Table 2 lists the same properties as Extended Data Table 1, 

but for nearby radio galaxies that have core-shifts and accretion disk luminosities 
reported in the literature (we found only eight such cases). Core-shift measures are 
taken from different individual experiments: NGC 1052'°, 3C 120", M 81'*, W 
Comae"™, M 8717, Cen A’*, 3C 390.3" and Cyg A’. Owing to their proximity, six of 
these objects have black hole masses measured either via stellar velocity dispersion/ 
nuclear gas dynamics modelling (NGC 1052, M 81, M 87, Cen A, Cyg A) or rever- 
beration mapping (3C 120). 
Calculating the jet poloidal magnetic flux from the core-shift measure. According 
to the standard model of relativistic jets*', the observed position of the radio core 
(a bright, synchrotron self-absorbed emission feature at the upstream end of the 
jet) is a function of the observing frequency. This frequency-dependent shift in the 
location of the core can be used to estimate the magnetic field strength and electron 
number density'***. The method is based on several simple and reasonable as- 
sumptions: the flow has a conical shape with a small half-opening angle, 0;, 
and constant J” (no significant acceleration or deceleration). Furthermore, the 
magnetic field strength and electron number density are assumed to decrease with 
increasing distance from the central engine following power-law dependencies: 
B'(z) =B,,(z/Ipe)™, m,(zZ) = nip(z/1pe)™ where z is the distance from the 
black hole (see Fig. 1). Here Bi, and n},,, represent the jet frame magnetic field 
strength and electron number density at distance of one parsec away from the apex 
of the jet. These assumptions lead to a power-law behaviour’’ for the amount of 
core-shift as a function of the observing frequency, «v— 1/ky where k, is the core- 
shift index. We calculate the core-shift using’: 


1/k, 1 /k, 
vl" V> 


T/kr 1] kr 
V5. Vy 


Arcore,v, v2 D. 


Q,y =4.85 x 107? 
(1+z.) 


pe GHz (2) 


where Afcore,y,y,> Dy, and Z* are the observed shift (in milliarcsec) of the core 
position between frequencies v, and vy, (in GHz), the luminosity distance, and 
redshift of the source respectively (note we use z+ for redshift to distinguish it from 
the cylindrical coordinate z). 

If the magnetic field decays with distance as z_' as expected for an azimuthally 
dominated magnetic field, and the magnetic to non-thermal particle energy ratio 
remains constant, then it is expected that k,= 1. Numerous observations have 
indeed revealed a k, = 1 behaviour of the core-shift, thus supporting our assumption 
of a azimuthally dominated magnetic field in the radio core region'*’**. Assuming 
equipartition between magnetic and particle energy at the position of the radio core, 
then with the above assumptions for the scaling of the magnetic field and relativistic 
electron density with distance, one can estimate the jet frame magnetic field strength 
at 1 pce from the black hole: 

FQ? (1 +25 y 


Bi .~0.025 
pe 0,6? sin? 0 


(3) 


where 6 and 0 are the Doppler factor and the viewing angle of the jet, respectively. It 
is assumed that the source has a spectral index of « = —0.5. Here ¢;., is the ratio of 
the energy density in the magnetic field to the energy density in the non-thermal 
relativistic particles responsible for jet emission. We assume equipartition such that 
Oye = 1, though if this assumption is incorrect it would not strongly affect our results 
because the jet magnetic flux goes as Dje, oll}. 

For the estimate of the magnetic field we assume that (when a good estimate 
of the viewing angle is not available) jets of this sample of blazars are viewed close 
to their critical angle’, 0 ~ I~’. We compared a sub-sample of 32 sources with 


known Doppler factors® and viewing and intrinsic opening angles****”’ and found 
this is in fact a reasonable assumption for this sample. This yields'*: 
1/8 
3 1/2 
Bi p~0.042 Q7/4(1+Zx) / (1 + Bop) (4) 


where fp is the characteristic apparent speed of a jet’s components. 

We assume the mean jet frame magnetic field measured in the radio core is pre- 
dominantly azimuthal, as expected in magnetically powered relativistic jets*“°. As 
discussed above, this assumption is supported by the findings from core-shift studies 
that k, = 1, as well as other VLBI studies** of jets that are consistent with a B ox ie 
scaling, which is the expected scaling for a azimuthally dominated magnetic field in a 
conical jet. Theoretically, in Blandford-Znajek jets the azimuthal to poloidal field 
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ratio is By / Bp ~Rj /Ruc~a.R; / [(4—8)rg], which clearly is much greater than unity 
on VLBI scales’. This ratio was calculated for a rapidly spinning black hole (a+ ~ 1), 
and Ric = ¢/Qy is the radius of the light cylinder, where Q, is the angular frequency of 
magnetic field lines that remains constant along the jet (it is probably*“** between 1/4 
and 1/2 of the black hole angular frequency 24; = ca,/(2ry)). 

Let us consider a representative field line in the bulk of the jet, where most of the 
energy flows out and the observer frame magnetic field is strongest. In an idealized 
case of a highly magnetized jet, the poloidal magnetic field is approximately uni- 
form over the cross-section of the jet, and we have two familiar relationships: the 
observer frame azimuthal field scales as By oc R~ 1 and the poloidal field scales as 
By ocR;?, where R; is the jet cylindrical radius“. Realistic jets are, however, prob- 
ably not very highly magnetized: the process of jet acceleration requires conversion 
of Poynting flux to kinetic flux. In this conversion process, B, becomes non- 
uniform across the jet”*. Because of this, B, decreases faster than R-? and By 
decreases faster than R-! in the jet acceleration region. The poloidal magnetic field 
strength B, along the field line is related** to the total enclosed poloidal magnetic 
flux in the jet by je, = kn} Bp, where k=(1—JI"/)~' = 1isthe correction factor 
and J is the Lorentz factor of the field line. Here jx is the Lorentz factor that the field 
line would have if all of the Poynting flux were converted into kinetic energy flux. If 
we now eliminate B, in favour of B, using the relationship in the previous para- 
graph, we obtain a more general form for equation (1): 


2n(IO;) muzB, 


et la,(1—I'/) (5) 


where z is the distance down the jet, By, =By / T is the azimuthal magnetic field in 
the jet frame, and ¢ is defined such that Q: = €Qy;. In order to reproduce equation 
(1), we assume z = 1 pc, € = 1/2, = p/2, and B, ~Bipe because the jet co-moving 
frame magnetic field probed by core-shift measurements on parsec-scales, By... is 
likely to be azimuthally dominated. The assumption that [= 1/2 is justified by 
different analytical and numerical**“ studies of jet acceleration, which have found 
that at the end of the acceleration region roughly 50% of a jet’s Poynting flux is 
converted into kinetic energy flux. We convert between jet cylindrical radius Rj and 
jet downstream distance z by using a conical jet approximation such that Rj ~ jz. 
Predicted magnetic flux in a magnetically arrested disk. Under the assumption 
that the width of the optical broad lines emitted from the clouds surrounding black 
holes is mainly regulated by the gravitational potential of the black holes (assum- 
ing that these ionized clouds follow Keplerian orbits) and using a well-established 
empirical relation between the size of the broad line region and optical continuum 
luminosity, one can derive an order of magnitude estimate for black hole masses 


(for different line diagnostics)'?°**: 
wong | Eee 
worms)" Seno 

oe Lar ‘ ees "Mo (8) 


Here L, is the observed monochromatic continuum luminosity at wavelength A 
and FWHMhine is the observed full-width at half-maximum of the line in question. 
In radio-loud AGN, the optical emission can be contaminated by the non-thermal 
synchrotron emission of their jets, so we estimate the optical continuum on the 
basis of the broad emission line luminosities”’. 

Furthermore, we have used broad (and in two cases narrow) line luminosities 
as a surrogate for the thermal (non-beamed) emission from the accretion disk. 


The scaling relations between Lacc and the line luminosity, Line, for different lines 
are 19,22,51,52, 


log, Lace = (12.32 + 0.32) + (0.78+0.01)log,y Lup (9) 


logy Lace = (16.76 + 0.26) + (0.68 + 0.01)log,y Laigi (10) 


(11) 


We may now calculate the predicted magnetically arrested disk magnetic flux for a 
source with known M and L,..: 


*6,2,\ 1? af uj—?| M 
50 (Mr c) =2.4x 10 
g 0.4 10 Me 


log, Lace = (26.50 + 0.32) + (0.46+0.01)log, Lom 


L 1/2 


2 
1.26 x 10*’erg s~! ee 2) 


To calculate M from L,.., we assume that all of the sources in our sample have the 
same radiative efficiency y. If the gravitational binding energy of accreting gas is 
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radiated away by the time it reaches the black hole’s innermost stable orbit, then 
for a maximally spinning black hole*’ 4 ~ 0.42. Magnetically arrested disks may 
behave differently from standard disks, with 7 ~ 0.5 for non-rotating black holes 
and potentially higher 1 for rotating black holes®. The radiative efficiency could be 
even higher if inverse Compton scattering by the hot, magnetized disk corona 
reprocesses disk emission. On the other hand, if a fraction of liberated binding 
energy is carried away mechanically (for example, by magnetized winds) instead of 
by radiation, the resulting 7 values could be lower (since the efficiency of magnetized 
winds in the magnetically arrested disk model is’ 7 = 0.2, we expect this to be at most 
amodest correction). Note that since the value of #1;, which we set out to measure, is 
proportional to 7”, our results are not particularly sensitive to the assumed value of 
yn. In this work, we adopt a representative value, 7 = 0.4. 

Linear regression and partial correlation analysis. In order to probe for possible 
correlations between observed values (for example, accretion luminosity and mag- 
netic field strength), one should take into account their possible common depend- 
encies on other parameters (for example, distance, mass and relativistic boosting). 
We have done this by using a generalized version of the partial Kendall’s tx correla- 
tion test™* that is suitable for our sample (which includes a number of upper limits). 
We used the publicly available FORTRAN code* which has been widely used for 
probing correlations in astronomical data sets. Extended Data Table 3 summarizes 
the results of our partial correlation analysis. 

Furthermore, we have double-checked our results with the ppcorr package** which 
is available as a public library for R°*’. The ppcorr package calculates the Kendall tx 
correlation between different observed values while controlling for dependencies on 
more than one commonly dependent variable. However, ppcorr does not take upper 
and lower limits (so-called censored data points) into account properly. Extended 
Data Table 4 summarizes the results of a test using ppcorr for a correlation between 
Der and wi? M while simultaneously controlling for common dependencies on mass, 
redshift and apparent velocity. 

The scatter in the observed correlation can be quantified in terms of the free 
parameter defining the theoretically predicted curves, pgy. If we search for the two 
theoretical curves that encompass the middle 68% of the data points (that is, 16% 
of the data lie above the upper line and 16% of the data lie below the lower line), 
then we find the curves defined by ¢gy = 24 and dgy = 93 satisfy this criterion 
(recall dp ~ 50 is the value predicted by magnetically arrested disk theory). Note 
that when we state the results of the fit, py = (52 + 5), the error represents the 
confidence interval of the best fit value, and does not refer to the scatter discussed 
above. The best fit value and its confidence interval are obtained using the MPFIT 
package**””. 

We also performed linear regression between different observed values (for ex- 
ample, Dye and qi? M) using a Bayesian method that accurately accounts for 
the censored data points®. For this purpose we have used the LINMIX_ERR 
routine which is publicly available as a part of the astrolib package®'. We have 
used the Metropolis-Hastings algorithm available in the code to obtain the pos- 
terior distribution. The linear regression results in a best-fit model log Pie, = 
(8.52 + 1.06) + (0.79 +0.03) x log Lal2M (1o confidence). 
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Object Type Bie Bapp (C) = Qrv (pe GHz) Bias (G) LogioM (Mo) LogioLacc (ergs~') Reference 
(1) (2) (3) (4) (5) (6) (7) (8) (9) 
0133+476 Q 0.86 12.98 13.69 0.77 8.55 46.11 Tor12 
0212+735 Q 2.37 7.64 21.24 1.27 9.96 47.33 Tor12 
0234+285 Q 1.21 12.26 35.82 1.71 9.22 46.80 Shw12 
0333+321 Q 1.26 12.76 41.51 1.95 9.25 47.17 Liu06 
0336—019 Q 0.85 22.36 14.41 0.92 8.89 46.33 Liu06 
0403—132 Q 0.57 19.69 33.30 1.54 9.08 46.52 Liu06 
0420—014 Q 0.91 7.36 35.94 1.44 8.41 46.48 Liu06 
04544844 B 0.11 0.14 10.91 0.27 7.42 44.66 Tor12 
0528+134 Q 2.07 19.20 22.71 1.60 9.03 47.23 Pal11 
0605—085 Q 0.87 19.79 13.24 0.84 8.87 46.44 Shw12 
0607—157 Q 0.32 3.93 21.22 0.68 7.32 45.20 Liu06 
0736+017 Q 0.19 14.32 <2.94 <0.20 7.86 45.69 Liu06 
0738+313 Q 0.63 10.76 16.85 0.81 9.57 46.93 Liu06 
0748+126 Q 0.89 18.37 13.49 0.84 9.06 46.88 Tor12 
0804+499 Q 1.44 1.83 11.15 0.48 9.39 46.80 Liu06 
0827+243 Q 0.94 22.01 19.64 1.18 8.89 46.46 Tor12 
0836+-710 Q 2.22 25.38 25.72 1.93 9.36 47.51 Liu06 
0859—140 Q 1.34 16.47 30.96 1.70 9.14 46.73 Tor12 
0906+-015 Q 1.02 20.68 29.45 1.61 8.55 46.61 Liu06 
0917+624 Q 1.45 15.57 16.99 1.09 8.93 46.57 Shw12 
0923+392 Q 0.69 4.29 <6.64 <0.33 9.09 46.94 Liu06 
0945+408 Q 1.25 18.60 17.02 1.10 8.99 46.46 Tor12 
0953-+-254 Q 0.71 11.52 <6.71 <0.42 8.70 46.30 Liu06 
1015+359 Q 1.23 12.46 15.55 0.92 8.69 46.65 Tor12 
1038-+064 Q 1.26 11.87 21.97 1.19 9.12 46.76 Tor12 
1127-145 Q 1.18 14.18 13.24 0.84 9.30 46.87 Tor12 
1156+295 Q 0.73 24.73 20.11 AZ 8.54 46.25 Liu06 
1219+044 Q 0.96 2.35 24.16 0.81 8.89 46.48 Tor12 
1222+216 Q 0.43 21.10 17.03 0.90 8.87 46.34 Tor12 
1253—055 Q 0.54 20.57 <5.88 <0.42 8.28 46.05 Liu06 
1302—102 Q 0.28 5.41 20.34 0.70 8.51 46.26 Liu06 
1308+326 Q 1.00 27.17 13.61 0.96 8.72 46.38 Tor12 
1334—127 Q 0.54 10.26 31.08 1.23 7.98 45.98 Liu06 
1458+718 Q 0.90 7.04 9.46 0.51 9.23 46.80 Liu06 
1502+106 Q 1.84 14.77 8.50 0.69 8.74 46.92 Liu06 
1508—055 Q 1.19 18.64 26.81 1.52 9.32 46.71 Tor12 
1510—089 Q 0.36 20.14 13.50 0.73 8.20 46.05 Liu06 
1546+027 Q 0.41 12.08 <5.09 <0.32 8.47 46.32 Liu06 
1606+106 Q 1.23 18.91 10.97 0.79 8.97 46.56 Tor12 
1611+343 Q 1.40 14.11 9.07 0.66 9.19 46.78 Tor12 
1633+382 Q 1.81 29.45 21.21 1.62 9.12 46.80 Tor12 
1637+-574 Q 0.75 10.61 13.51 0.71 9.22 46.77 Liu06 
1641+399 Q 0.59 19.27 23.85 1.20 9.27 46.69 Liu06 
1642+690 Q 0.75 16.65 <6.85 <0.48 6.81 45.17 Tor12 
1655-++-077 Q 0.62 14.45 7.45 0.47 7.28 45.60 Liu06 
1726+455 Q 0.72 1.82 <6.73 <0.28 9.18 46.63 Tor12 
1749+701 B 0.77 6.03 27.03 1.04 8.77 45.96 Tor12 
1803+784 B 0.68 8.97 <6.58 <0.39 7.92 45.98 Liu06 
1807+698 B 0.05 0.10 3.81 0.12 8.51 44.98 Tor12 
1823+568 B 0.66 20.85 11.79 0.74 7.94 45.61 Tor12 
1828+487 Q 0.69 13.65 12.24 0.69 8.41 46.34 Tor12 
1849-+670 Q 0.66 30.63 6.48 0.52 8.81 46.22 Tor12 
19014319 Q 0.63 2.67 29.03 0.87 7.72 46.08 Tor12 
1928+738 Q 0.30 8.43 12.31 0.54 8.35 46.47 Liu06 
2121+053 Q 1.94 13.29 22.61 1.43 8.78 47.15 Liu06 
2128—123 Q 0.50 6.94 26.41 0.98 9.02 46.65 Liu06 
2134+004 Q 1.93 5.94 26.23 1.31 9.36 46.93 Tor12 
2145+067 Q 0.99 2.52 <7.48 <0.34 8.87 47.07 Liu06 
2155—152 Q 0.67 18.11 43.23 1.89 7.59 45.67 Woo02 
2200+420 B 0.07 10.57 1.21 0.09 8.23 45.18 Tor12 
2201+315 Q 0.29 7.87 27.04 0.95 8.94 46.68 Liu06 
2209+236 Q 1.12 3.43 7.69 0.39 8.46 46.39 Shw12 
2227 —088 Q 1.56 8.14 29.60 1.44 8.98 46.58 Tor12 
2230+114 Q 1.04 15.41 46.48 2.12 8.93 46.58 Tor12 
2243—123 Q 0.63 5.49 20.10 0.78 8.81 46.64 Tor12 
2251+158 Q 0.86 14.19 22.00 0.32 8.69 46.86 Liu06 
2345—167 Q 0.58 13.45 18.44 0.90 8.47 45.84 Liu06 
2351+456 Q 1.99 18.01 25.97 v2 9.29 46.68 Tor12 


(1), IAU name (B1950); (2), classification of the source (Q, quasar; B, BL Lac object; RG, radio galaxy); (3), redshift; (4), apparent velocity of the jet; (5), core-shift measure; (6), magnetic field strength one parsec 
away from the jet base; (7), mass of the black hole; (8), accretion disk luminosity; (9), reference for the line used for estimating mass and accretion luminosity. References: Liu06, ref. 21; Tor12, ref.62; Shw12, ref. 50; 


Pal11, ref. 63; Woo02, ref. 20. 
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Extended Data Table 2 | Radio galaxy sample 


Object Type ae Bapp (C) = Qrv (pe GHz) Bine (G) LogioM (Me) —Logio Lace (erg s~*) Reference 
(1) (2) (3) (4) (5) (6) (7) (8) (9) 
0238—084 (NGC 1052) RG 0.0050 0.23 0.226 0.015 8.19 41.24 Woo002/Gon09 
0430+052 (3C 120) RG 0.033 4.66 1.85 0.11 7.36 43.92 King13 
0951+693 (M 81) RG 0.0001 1.45 0.031 0.0066 7.80 39.77 Mar08 
1219+285 (W Coma) RG 0.102 4.05 1.39 0.08 8.69 43.25 Ghe10 
1228+126 (M 87) RG 0.004 0.24 0.132 0.012 9.82 40.85 Geb11/DiM03 
1322—427 (Cen A) RG 0.0018 0.50 0.155 0.017 7.74 41.68 Neu09/Eva04 
18454797 (3C 390.3) RG 0.057 2.29 1.65 0.089 8.83 44.61 Tor12 
1957+405 (Cyg A) RG 0.0562 0.20 2.10 0.07 9.43 44.57 McN11/Kino05 
Similar to Extended Data Table 1. References: DiMO3, ref.64; Eva04, ref.65; Geb11, ref. 66; Ghe10, ref.67; GonO9, ref. 68; King13, ref.69; Mar08, ref. 70; McN11, ref. 71; NeuO9, ref. 72; KinoO5, ref. 73; Tor12, ref. 62; 


Woo02, ref. 20. 
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Extended Data Table 3 | Partial correlation analysis results based on cens_tau algorithm 


Variables Correlation 
Xi X2 X3 TK Poon Significance of rejecting the null hypothesis 
All sources: 
Log L1/?M Log ®jct Ze 062 <10-% >5o0 
Log L3/?M Log®;- LogM 046 11x 10% 3.30 
Log L1/?M Log ®jc,  Bapp 0.72 <107-'4 >5o 


Excluding LLAGN: 


Log £1/2M Log ®jet Zn 0.61 210-4 >5o 
Log L1/?M Log ®j. LogM 0.21 1.2x 1073 3.20 
Log £1/2M Log ®jet Bapp 0.70 <i10-“ >5o0 


X, and X2 are variables that have a possible mutual dependency on variable X3. tx and Pru are the Kendall’s partial correlation coefficient and probability for the null hypothesis of no correlation between X; and Xz, 
respectively. The last column shows the corresponding significance by which the null hypothesis could be rejected. 
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Extended Data Table 4 


Results of partial correlation analysis based on ppcorr algorithm 


Excluding upper limits: 


Excluding upper limits 


and LLAGN: 


Variables Correlation 
Xo Xz Xa X5 TK Pru 


Log ®je, 2.  LogM  fBapp 0.52 1.9x 10° 


Log®j z  LogM Bapp 0.32 5.2x 1073 


Significance of rejecting the null hypothesis 


3.00 


X; and Xz are variables that have a possible mutual dependency on variables X3, Xq4 and Xs. tk and Pu are the same as in Extended Data Table 3. 
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The development of new chemical transformations based on cata- 
lytic functionalization of unactivated C—H bonds has the potential 
to simplify the synthesis of complex molecules dramatically. Tran- 
sition metal catalysis has emerged as a powerful tool with which to 
convert these unreactive bonds into carbon—carbon and carbon— 
heteroatom bonds’, but the selective transformation of aliphatic 
C—H bonds is still a challenge. The most successful approaches involve 
a ‘directing group’, which positions the metal catalyst near a parti- 
cular C—H bond, so that the C—H functionalization step occurs via 
cyclometallation’. Most directed aliphatic C—H activation processes 
proceed through a five-membered-ring cyclometallated intermediate*””. 
Considering the number of new reactions that have arisen from such 
intermediates, it seems likely that identification of distinct cyclo- 
metallation pathways would lead to the development of other useful 
chemical transformations". Here we report a palladium-catalysed 
C—H bond activation mode that proceeds through a four-membered- 
ring cyclopalladation pathway. The chemistry described here leads 
to the selective transformation of a methyl group that is adjacent to 
an unprotected secondary amine into a synthetically versatile nitro- 
gen heterocycle. The scope of this previously unknown bond discon- 
nection is highlighted through the development of C—H amination 
and carbonylation processes, leading to the synthesis of aziridines 
and f-lactams (respectively), and is suggestive of a generic C— H fun- 
ctionalization platform that could simplify the synthesis of aliphatic 
secondary amines, a class of small molecules that are particularly im- 
portant features of many pharmaceutical agents. 

Several classes of directing group can participate in directed C-H acti- 
vation of aliphatic systems and include functionalities such as carbonyl 
derivatives, aromatic nitrogen heterocycles and hydroxyl motifs'*"*. Pal- 
ladium salts have been particularly successful at effecting cyclometalla- 
tion with these functional groups; coordination of the palladium centre 
to the Lewis basic heteroatom within the directing motifs is assumed to 
lower the entropic and enthalpic costs of the C-H bond cleavage and ring 


closure, and accounts for why kinetically favoured five-membered-ring 
intermediates are preferred over other cyclopalladation pathways. Reac- 
tion of the resulting carbon-palladium bond with an external reagent 
leads to an overall chemical transformation that sees a carbon—hydrogen 
bond converted into a synthetically versatile functional group. As a 
result, this type of cyclopalladation event has led to a number of useful 
catalytic C-H bond functionalization processes that have expanded the 
toolbox of reactions available to synthetic chemists (Fig. 1a). Despite 
their success, one limitation of catalytic cyclopalladation is that the exqui- 
site selectivity displayed through five-membered-ring intermediates 
means that the generic strategic bond disconnection enabled by direc- 
ted C-H activation is often restricted to a single position with respect 
to the controlling functional group. As part of an overarching goal to 
develop new activation modes for catalytic C-H activation, we ques- 
tioned whether cyclopalladation could function in systems that do not 
possess C-H bonds in the positions that would facilitate classical, kinet- 
ically favoured insertion processes. Furthermore, we reasoned that C-H 
activation directed by common functional groups represents an impor- 
tant challenge to the continued advance of this field; whereas native 
functionalities such as carboxylic acids and hydroxyl groups have been 
shown to participate in catalytic cyclopalladallation’”"*, there are no 
examples of directed catalytic C-H activation with unprotected aliphatic 
amines’’**. Here we report the successful realization of these ideas via a 
new C-H activation strategy which uses an unprotected aliphatic sec- 
ondary amine to direct a palladium-catalysed C-H activation event that 
transforms an adjacent methyl group into a synthetically versatile nitrogen 
heterocycle (Fig. 1b). This novel palladium-catalysed activation process 
proceeds through a remarkable four-membered-ring cyclometallation 
pathway and enables a broad scope for catalytic C-H functionalization 
through both Pd(m) or Pd(1v) intermediates. 

At the outset of our studies, we selected an amine substrate display- 
ing a series of C-H bonds positioned in such a way that they could not 
engage in classical five-membered-ring cyclopalladation (Fig. 2a). When 
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Figure 1 | Palladium-catalysed directed C-H activation modes. a, Classical 
directed C-H activation via a kinetically favoured five-membered-ring 
cyclopalladation complex. X, directing group; FG, new functional group. 

b, New directed C-H activation of an aliphatic secondary amine, via a 


cyclopalladation complex 


aziridine product 


four-membered-ring cyclopalladation complex and novel reactivity in the 
presence of an oxidant, to form synthetically versatile strained nitrogen 
heterocycles, such as aziridines. R'3, general aliphatic substituents. 
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LETTER 


the aliphatic secondary amine 1a was treated with a stoichiometric amount 
of palladium(1) acetate, we observed a new trinuclear organopalladium 
complex (int-I) where C-H activation had taken place at one of the 
unactivated methyl groups proximal to the amine and formed a four- 
membered-ring intermediate (as determined by single-crystal X-ray 
diffraction of a phosphine derivative, 2a). To the best of our knowledge, 
this is the first example of an amine-directed four-membered-ring cyc- 
lopalladation event”*”’. We next probed the reactivity of these strained 
cyclopalladation complexes; we questioned whether mild chemical oxi- 
dants, such as the hypervalent iodine compound PhI(OAc),, might allow 
access to C-H functionalization pathways proceeding through high- 
valent Pd(1v) cyclometallated intermediates”, as well as formulating the 
basis of a catalytic transformation. When we treated a cyclopalladation 
complex (int-I or 2a) with PhI(OAc), we were surprised to discover aziri- 
dine 3a, presumably formed via C-N bond forming reductive elimina- 
tion from a high oxidation state Pd(Iv) intermediate (int-IIT, Fig. 2b, c). 
We established a catalytic transformation based on this unusual C-H 
functionalization event; after optimization, the Pd-catalysed C-H pro- 
cess requires only 5 mol% catalyst to produce aziridine 3a on a multi- 
gram scale. Furthermore, the reaction is also selective between the two 
different types of methyl groups (red and blue in Fig. 1a), indicating a 
subtle stereoelectronic differentiation imparted by the nearby carbonyl 
group. The distinctive nature of the directed cyclopalladation pathway 
was further evidenced when we tested amine 1b, where cyclopallada- 
tion can proceed through a four-membered-ring (activation on methyl 
group) or the conventional five-membered-ring pathway (activation 
on ethyl group) (Fig. 2d). Remarkably, we still observe activation through 
the four-membered-ring pathway forming the related strained com- 
plexes (int-II, 2b)—we do not believe that such selectivity has been 
observed before. We also found that the amine 1b underwent the same 
catalytic transformation in very good yield, forming the correspond- 
ing aziridine 3b (Fig. 2e). Not only does this catalytic cyclopalladation 
pathway contrast with classical directed C-H activation, it also reveals 
novel chemical reactivity that underpins a conceptually distinct discon- 
nection to synthesize strained nitrogen heterocycles directly from simple 
aliphatic amines displaying methyl groups adjacent to an unprotected 
nitrogen motif. 
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Figure 2 | A new amine-directed, palladium-catalysed directed C-H 
activation mode. a, Coordination of amine 1a to the Pd(11) centre facilitates a 
cyclopalladation to give a four-membered-ring complex int-I. Proof of 
structure is obtained through X-ray diffraction of a phosphine derivative, 

2a. b, e, Palladium-catalysed C-H aziridination of secondary amines (1a in 
panel b, 1b in panel e). Reaction to give respective aziridines 3a and 3b most 
probably proceeds through the oxidation of int-I to the Pd(1v) intermediate 
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In assessing the scope of the catalytic reaction, a range of amines 
displaying a variety of substituents were found to be suitable substrates 
(Fig. 3). For example, simple alkyl-(methyl) substituted morpholinones 
undergo the C-H aziridination process in high yield (3a—e) and are 
always selective for the four-membered-ring C-H activation pathway. 
Substituents containing aromatic rings, protected hydroxyl groups, sen- 
sitive chloromethyl and fluoromethyl motifs, esters and protected amines 
all work well, indicating a broad tolerance of the process to a variety of 
useful functional groups (3f-k). The C-H aziridination reaction can 
be applied to methylene C-H bonds, albeit in lower yield, to form 31, 
further expanding the potential efficacy of this process. Morpholinones 
that display only three substituents around the secondary amine motif 
have so far not been found to react under the standard conditions. Sub- 
strates containing substituents other than methyl groups distal to the 
carbonyl motif can be accommodated; a piperidine substituted morpho- 
linone undergoes smooth C-H aziridination to form a product (3n) 
displaying a high density of useful functionality. Interestingly, we found 
that amine derivatives not possessing the lactone framework also under- 
went catalytic C-H activation through the four-membered-ring cyclopal- 
ladation pathway in the presence of PhI(OAc), but formed acetoxylation 
product (5) rather than an aziridine. This again suggests that the carbonyl 
group in the morpholinones may play a subtle controlling role, this 
time steering the reactivity of the putative organo-Pd(Iv) intermediate. 
Importantly, acyclic amines also undergo directed C-H activation to 
form an amino alcohol derivative (6) is reasonable yield. 

One of the most important uses of aziridines is their ring opening 
reactions with nucleophiles”’. Despite the non-activated and hindered 
nature of these aziridines (3), we found that a range of nucleophiles can 
open this strained heterocycle to form complex amine products (Fig. 3c). 
For example, heteroatom nucleophiles such as azide, thiol, chloride and 
water open the aziridine in good yield under acidic conditions (4a-d). 
Notably, fluoride can also be used as a nucleophile, forming amine 4e. 
The aziridine motif can also be opened with a carbon nucleophile (N- 
methylindole) to form 4f. To further demonstrate the synthetic ver- 
satility of these products, the functionalized aziridine 3g can also be 
opened with pyrazole followed by saponification-oxidative cleavage of 
the cyclic framework, revealing a highly functionalized quaternary amino 
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int-III (c) before a reductive elimination to form a C-N bond. L, undefined 
neutral ligand. d, The novel cyclopalladation pathway prevails when a classical, 
kinetically favoured five-membered-ring cyclopalladation (int-II) is possible; 
the structure of int-II was determined through X-ray diffraction of a phosphine 
derivative, 2b. For X-ray structures, grey is C, red is O, dark blue is N, 
aquamarine is Pd, orange is P, green is Cl, and light blue is F. 
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Figure 3 | Scope of the palladium-catalysed C-H aziridination process. 

a, Transformation of aminolactone (1) to aziridine (3) and then ring opening 
with nucleophiles (Nu-H) gives complex amine 4. b, Scope of aziridination 
reaction. For 5, mass balance comprises a complex mixture of polyacetoxylated 
products. c, Scope of aziridine ring opening (see Supplementary Information 


acid 7a, characteristic of a class of molecule that would be difficult to 
make by conventional synthetic methods (Fig. 3d). Moreover, the C-H 
activation strategy also provides access to a previously inaccessible class of 
fully substituted secondary aliphatic amines. The lactone motif in mor- 
pholinones such as 4f can be transformed into useful functionality (7b, c) 
that is representative of a class of complex fully substituted aliphatic 
secondary amines with unexplored biological properties; the limited 
number of ways through which this class of sterically hindered amine 
can be synthesized means they have therefore been largely ignored as 
pharmaceutical candidates. However, we believe that the important nature 
of aliphatic nitrogen-containing compounds in medical chemistry will 
generate significant interest in this elusive class of amines. 

We next explored the reactivity of these molecules under a comple- 
mentary palladium manifold. Replacing PhI(OAc), with carbon mon- 
oxide and a suitable oxidant would lead to a potential carbonylation 
process through a Pd(11)-Pd(0) catalytic cycle*. We found that mixing 
tetramethylpiperidine (TMP) with stoichiometric Pd(OAc), gave rise 
to a four-membered-ring palladacycle (8) that could be directly char- 
acterized by X-ray diffraction (Fig. 4a). Furthermore, when we treated 
this complex with carbon monoxide, a B-lactam (9a) was formed in high 
yield (Fig. 4b). Here again, a strained nitrogen heterocycle can be formed. 
from a methyl group adjacent to a secondary amine, and this further 
underlines the synthetic versatility of this new directed C-H activation 
mode. Assessment of a range of reaction parameters led to an opti- 
mized catalytic process that required treatment of TMP with 10 mol% of 
Pd(OAc)z, and 10 mol% of Cu(OAc), as oxidant, in toluene at 120 °C 
under a CO/air mixture (at atmospheric pressure for 22h) to form 
B-lactam 9a in 87% yield. As shown in Fig. 1b, an array of commercial 


57% 98% 


4f 7c 


for experimental procedures). d, Synthesis of a functionally complex 
quaternary amino acid (7a) from 3g. e, Representative transformations of the 
aziridine ring opening products (4f) into highly functionalized fully substituted 
secondary aliphatic amines (7b, c). 


or readily available piperidine derivatives are competent substrates. Not 
only does this reaction produce a series of novel f-lactams that may 
themselves have interesting biological properties, but many of the pro- 
ducts also display orthogonal and versatile functionality that can be 
easily transformed into more complex amines. In addition to standard 
piperidine derivatives (9a, b), a number of other heterocycles could also 
be readily tolerated: morpholine and piperizine scaffolds, fluorinated 
piperidines and a seven-membered-ring azepine all productively form 
the corresponding B-lactams and display the type of structural motifs 
that are frequently considered important in the design of pharmaceu- 
tical agents (9c-f). The morpholinone 1a also undergoes carbonylation, 
but, in contrast to the C-H aziridination process, a 2:1 mixture of sepa- 
rable B-lactams, in favour of 9g, is formed. This result is surprising given 
the selectivity of the cyclopalladation with this substrate and suggests 
that the C-H palladation step may be reversible under carbonylation 
conditions. Acyclic amines are also effective substrates, producing the 
monocyclic B-lactam 9h in good yield. 

Given the broad utility and application of aliphatic amines, it is sur- 
prising that they are seldom employed in C-H bond functionalization 
reactions, especially considering that secondary amines are excellent 
coordinating groups for palladium salts. When treated with palladium(1) 
salts, most amines rapidly form square-planar, coordinately saturated 
bis-amine palladium(11) complexes, but the pathway for C-H bond func- 
tionalization requires liberation of a coordination site occupied by one 
of these amines”’. As the palladium(11) complexes formed from these 
amines are typically very stable, there is little driving force for the release 
of an amine ligand. This generally renders the bis-amine palladium species 
catalytically inactive unless they are subjected to thermal activation that 
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Figure 4 | Palladium-catalysed C-H carbonylation of aliphatic amines. 
a, Similarly to amine 1a, treatment of TMP with Pd(OAc), generates the 
four-membered ring cyclopalladation complex 8, whose structure was directly 
determined by X-ray diffraction. b, Scope of the C-H carbonylation reaction. 


can also lead to unwanted side reactions. The hindered secondary amines 
used in this study, which contain two fully substituted carbon atoms 
bonded to nitrogen, also form bis-amine complexes with Pd(OAc), (10, 
Fig. 5). We propose that the steric hindrance around the Pd(u) centre 
results in more facile dissociation. This weaker binding would therefore 
facilitate the release of one of the amines to create the essential vacant 
coordination site (int-IV) and enable C-H activation to take place”. 
We speculated that if secondary amines that do not possess two fully 
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Top row, reaction of 1 to give 9; middle and bottom rows, range of products 
and yields. For 9g, a 2:1 mixture of regioisomers was observed in favour of 
the one shown. For 9h, the reaction was conducted with AgOAc in dioxane 
at 125 °C. 


substituted carbon groups directly attached to the NH motif could be 
accommodated (as represented by amine 11), then the efficacy of this 
new C-H activation mode would be significantly expanded. Although 
we were conscious such amines might be prone to competitive decom- 
position pathways potentially via B-hydride elimination, we found that 
these less hindered amines, composed ofa fully substituted carbon atom 
on one side of the NH motif and a tertiary carbon atom on the other, 
undergo the palladium-catalysed C-H carbonylation reaction to form 
the corresponding B-lactams 12a-c (Fig. 5b). The C-H activation path- 
way was even viable on amines displaying two tertiary substituents (hence 
a C-H bond adjacent to nitrogen on both substituents), and although 
the yield to form 12d is low, this initial result suggests that the generic 
activation pathway could be possible on many classes of secondary amine. 
Finally, we were also able to demonstrate that more classical five- 
membered-ring cyclopalladation pathways can also be initiated by this 
type of secondary aliphatic amine. On treatment of ethyl-substituted 
morpholinone 30 with Pd(OAc), and PhI(OAc), as oxidant, we observed 
a reasonable yield (55%) of the C-H acetoxylation product 13 that is 
presumably derived from the five-membered-ring palladacycle 14. Based 
on the extensive number of transformations available to related five- 
membered-ring cyclopalladation pathways, this result suggests that ali- 
phatic amines could be amenable to a range of new reactions through 
this well established activation mode. 

Taken together, the present results represent an important initial 
advance that could substantially broaden this directed C-H activation 
methodology to further classes of secondary amine, and further rein- 
forces the potential of a new strategic bond disconnection which con- 
verts simple methyl groups into a range of versatile strained nitrogen 
heterocycles. As the physical and biological properties of aliphatic amines 
are central to the function of many important pharmaceuticals, chemical 
reagents and polymeric materials, we expect that such an approach to 
amine synthesis will be broadly useful in the synthesis of complex func- 
tional amines. 


Figure 5 | Towards a general strategy for C-H activation of aliphatic 
amines. a, Amines bind strongly to palladium(1) salts to form stable bis-amine 
complexes (10). Dissociation of one of the amine ligands leads to C-H 
activation and then (via int-IV) to the cyclopalladation complex 8. The steric 
properties of these hindered amines are likely to be responsible for the facile 
dissociation; note that the Pd(11) centre is projected directly into a high 
concentration of aliphatic C-H bonds. b, Top row, less hindered amines 

(11) can also undergo the directed C-H activation through the four-membered 
ring; bottom row, scope of products (12a-d). c, Demonstration that C-H 
activation of unprotected aliphatic secondary amines (such as 30) can also 
proceed through five-membered-ring cyclopalladation pathways (14) to give 
compounds such as 13. 


©2014 Macmillan Publishers Limited. All rights reserved 


METHODS SUMMARY 


Pd(i1)-catalysed C-H amination to aziridines. An oven-dried round-bottom 
flask, equipped with stir bar, was charged with the amine (1.0 equiv.), Pd(OAc), 
(0.05 equiv.) and iodobenzene diacetate (1.5-2.5 equiv.). The solvent (0.1-0.05 M) 
was added, followed by acetic anhydride (2.0 equiv.), and the flask capped under 
an air atmosphere. The flask was placed ina pre-heated oil bath at 70 °C and stirred 
for the given time. The reaction mixture was then cooled to room temperature, filtered 
through Celite, eluted with ethyl acetate, and concentrated in vacuo. The residue 
was purified by flash column chromatography to provide the pure aziridine product. 
Pd(11)-catalysed C-H carbonylation to B-lactams. An oven-dried round-bottom 
flask, equipped with stir bar, was charged with Pd(OAc) (0.1 equiv.), Cu(OAc)2 
(0.1 equiv.) and the amine (as a 0.1 M solution in toluene), and then fitted with a 
reflux condenser and rubber septa. A balloon of air and a balloon filled with carbon 
monoxide were fitted and then the flask was placed in a pre-heated oil bath at 
120°C such that the oil level matched the level of solvent in the flask and stirred for 
22-24h. The reaction mixture was then cooled to room temperature and filtered 
through Celite, eluting with ethyl acetate. The filtrates were washed with a 0.1M 
solution of hydrochloric acid (20 ml), a saturated aqueous solution of NaHCO; 
(20 ml) and brine (20 ml). The organic phase was dried (MgSO,), filtered and con- 
centrated in vacuo. The residue was then purified by flash column chromatography 
to provide the pure B-lactam product. 
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Millennial-scale variability in Antarctic ice-sheet 
discharge during the last deglaciation 
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Our understanding of the deglacial evolution of the Antarctic Ice 
Sheet (AIS) following the Last Glacial Maximum (26,000-19,000 years 
ago)’ is based largely on a few well-dated but temporally and geo- 
graphically restricted terrestrial and shallow-marine sequences” *. 
This sparseness limits our understanding of the dominant feedbacks 
between the AIS, Southern Hemisphere climate and global sea level. 
Marine records of iceberg-rafted debris (IBRD) provide a nearly con- 
tinuous signal of ice-sheet dynamics and variability. IBRD records from 
the North Atlantic Ocean have been widely used to reconstruct vari- 
ability in Northern Hemisphere ice sheets’, but comparable records 
from the Southern Ocean of the AIS are lacking because of the low 
resolution and large dating uncertainties in existing sediment cores. 
Here we present two well-dated, high-resolution IBRD records that 
capture a spatially integrated signal of AIS variability during the last 
deglaciation. We document eight events of increased iceberg flux from 
various parts of the AIS between 20,000 and 9,000 years ago, in marked 
contrast to previous scenarios which identified the main AIS retreat 
as occurring after meltwater pulse 1A*** and continuing into the 
late Holocene epoch. The highest IBRD flux occurred 14,600 years 


ago, providing the first direct evidence for an Antarctic contribution 
to meltwater pulse 1A. Climate model simulations with AIS fresh- 
water forcing identify a positive feedback between poleward trans- 
port of Circumpolar Deep Water, subsurface warming and AIS melt, 
suggesting that small perturbations to the ice sheet can be substantially 
enhanced, providing a possible mechanism for rapid sea-level rise. 

Today, the estimated total iceberg calving flux from Antarctica is about 
1,300-2,000 gigatons per year (Gt yr_)°, with giant (longer than 18 km) 
icebergs representing at least half of the total AIS mass loss’°. Iceberg tra- 
jectory studies for the sea-ice-covered part of the Southern Ocean show 
that only a small fraction of icebergs calving from the AIS escapes coastal 
regions directly to the north, owing to topographic steering of ocean 
currents”. Otherwise, a substantial fraction of large icebergs’ remain 
entrained in the westward-flowing Antarctic Coastal Current, travel- 
ling counter-clockwise around Antarctica before entering the Weddell 
Sea’ (Fig. 1). Here, they follow the cyclonic wind-driven Weddell Gyre, 
where they merge with icebergs calving from the Filchner-Ronne Ice 
Shelf in the southern Weddell Sea. These icebergs then travel north 
through the so-called ‘Iceberg Alley’ and exit the Weddell Sea basin to 
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the northeast into the Scotia Sea (Fig. 1). Around 55° S-60 °S, icebergs 
start to move in a more eastward direction as they are swept into the 
warmer Antarctic Circumpolar Current. Melt rates remain low until 
the warmer Antarctic Circumpolar Current is reached, after which the 
icebergs ablate rapidly in the general area of our core sites. 

We present two high-resolution records of IBRD from deep-sea core 
sites MD07-3133 and MD07-3134, collected from the south-central 
Scotia Sea (Fig. 1) (Methods). Dating relies mainly on the correlation of 
dust records from our cores to the Antarctic European Project for Ice 
Coring in Antarctica (EPICA) Dronning Maud Land (EDML) dust 
record’? (Methods). Ages reported here refer to the EDML1 or EPICA 
Dome C (EDC) 3 age scale (Methods). The cores are located in the centre 
of Iceberg Alley, the primary route taken by large icebergs from the AIS 
that remained entrained in the Antarctic Coastal Current today as well 
as during the last deglaciation (Methods). Owing to the loss of some 
icebergs northwards from the Antarctic Coastal Current, and the grad- 
ual ablation of icebergs within the Antarctic Coastal Current, relatively 
fewer icebergs may reach our sites from more distant sources (Methods). 
Although the primary IBRD content of icebergs may vary substantially 
and most of the coarser basal material is released close to the source 
(that is, the coarse-grained IBRD signal may havea regional bias towards 
the Weddell Sea and the Antarctic Peninsula), the majority of the fine- 
grained englacial material determined here should capture a spatially 
more integrated signal that is representative of variability in the flux of 
icebergs released by the AIS (except for the Amundsen Sector), parti- 
cularly in a colder climate than today’s. 

The IBRD record from site MD07-3134 exhibits a number of peaks 
starting at about 20,000 years (20 kyr) ago and continuing through the 
deglaciation until about 9 kyr ago, when modern IBRD levels were reached 
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(Fig. 2c). Most of the pronounced IBRD events are replicated at site MD07- 
3133 (Fig. 2d), which is about 300 km to the northwest (Fig. 1), thus 
providing a robust signal of episodic enhanced iceberg flux through 
Iceberg Alley. Stacking the IBRD records (Fig. 2e) and their fluxes (Fig. 2f) 
(Methods) identifies eight primary episodes of enhanced iceberg flux 
originating from different parts of the AIS that began within one to sev- 
eral decades and lasted from centuries to a millennium. These records 
thus provide the first evidence for substantial centennial to millennial 
variability in mass loss from the AIS throughout much of the last degla- 
ciation. We refer to these events as Antarctic Ice Sheet discharges (AID8 
to AID1, from oldest to youngest); for error estimates on the timing and 
duration of AIDs see Methods. 

AID8 occurred 20-19 kyr ago (Fig. 2), which corresponds to the start 
of the deglaciation of the Antarctic Peninsula Ice Sheet’ and the East 
AIS in the southeastern Weddell Sea*. Although the IBRD event in the 
stacked record has a concentration of debris similar to that of several of 
the younger events, its flux (Fig. 2f) is the smallest, which is consistent 
with an early onset of deglaciation for just the Antarctic Peninsula and 
Weddell Sea sectors in response to increasing Southern Hemisphere 
obliquity and sea-level forcing originating from the onset of deglacia- 
tion of Northern Hemisphere ice sheets’. 

Subsequent AID events generally have similar or larger concentra- 
tion in the IBRD stack than does AID8 (Fig. 2e), whereas the IBRD fluxes 
of AID7 to AID1 exceed those of AID8 (Fig. 2f). AID7 started 17 kyr ago, 
with IBRD increasing until around 15.7 kyr ago, when the event abruptly 
ended. The distinct IBRD peak near the end of AID7 about 16 kyr ago has 
been documented as a second retreat phase of the Weddell Sea sector of 
the East AIS*. Moreover, most of the marine-based sectors of the Ant- 
arctic Peninsula Ice Sheet had retreated by about 16.7 kyr ago’, along 
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with sectors of the West AIS (for example, the Pine Island glacier of the 
Amundsen Sea embayment) at 16.4 kyr ago (Methods) that are generally 
coincident with the major increase in the Scotia Sea IBRD flux rates 
starting at 16.4 kyr ago (Fig. 2f). 

The start of AID7 follows the onset of Antarctic-wide deglacial warming”* 
(Fig. 3f) and the retreat of sea ice’* (Fig. 3e). As indicated by a decrease 
in the stable carbon isotope ratio of atmospheric carbon 5'°Cytm (ref. 14; 
Fig. 3c), enhanced Southern Ocean opal flux (Fig. 3d), and declining 
dust/magnetic susceptibility contents (Fig. 2a,b), the onset of AID7 
is accompanied by enhanced Southern Ocean ventilation and changes 
in atmospheric circulation that may have caused an encroachment of 
warmer water to the AIS marine margins, leading to destabilization 
and a period of enhanced iceberg release. 

There is a distinct pause in ice rafting between AID7 and AID6 (Figs 2, 3) 
accompanied by a slowdown in atmospheric warming in the Atlantic 
sector. The subsequent AID6 exhibits the largest IBRD flux (Fig. 3a and b). 
AID6 has a relatively abrupt onset at about 15 kyr ago and reaches its 
peak values approximately 14.8-14.4 kyr ago, followed by a more gradual 
decline until 13.9 kyr ago, when it abruptly ends. There is general agree- 
ment among the few existing well-dated records from Antarctica for wide- 
spread retreat of the AIS at this time, including from Mac Robertson 
Land of the East AIS, the Ross Sea sector of the West AIS, and the Ant- 
arctic Peninsula Ice Sheet”* (Methods). 

The interval of peak IBRD flux during AID6 (14.8-14.4 kyr ago + 
0.25 kyr, where the error is 1a) is synchronous with the onset of the 
Bolling interstadial in the Northern Hemisphere (14.64 + 0.09 kyr ago) 
(Fig. 3g) and with the period of rapid sea-level rise referred to as melt- 
water pulse (MWP) 1A (occurring 14.65-14.3 kyr ago + 0.03 kyr ago)”. 
Fingerprinting studies using the new Tahiti sea-level record and other far- 
field records’* support previous such studies’* in identifying a substantial 
(at least 50%) Antarctic contribution to the global mean sea-level rise of 


about 14 m during MWP-1A. The fact that the peak flux of AID6 in our 
well-dated IBRD record is the same age and duration (around 400 
years) as MWP-1A (as defined from the Tahiti record) provides direct 
evidence for an Antarctic contribution. 

Several factors may have preconditioned or triggered the initial AIS 
grounding-line (boundary between the floating ice shelf and the grounded 
ice that rests on bedrock) retreat during AID6, including sea-level rise 
from melting of Northern Hemisphere ice sheets’’, subsurface warm- 
ing associated with changes of the Atlantic Meridional Overturning 
Circulation’, and increases of atmospheric greenhouse gas concentra- 
tions. To further explain the climate response to the meltwater discharge 
associated with AID6, we conducted a series of transient numerical mod- 
elling experiments with three different Earth system models (Methods). 
Assuming that AID6 freshwater forcing corresponded to a maximum 
sea-level rise of 20 mm yr 1 (50% of the maximum MWP-1A rate’’), 
our model results show considerable Southern Ocean surface cooling, 
expansion of sea ice, formation of a halocline, reduction of Antarctic 
Bottom Water formation and subsurface warming at depths of 100- 
1,500 m through poleward migration of Circumpolar Deep Water (Fig. 4). 
Consistent with previous modelling experiments'*”’, we suggest that 
this ocean thermal forcing accelerated thinning of ice shelves by basal 
melting, inducing a positive feedback by causing grounding-line retreat, 
calving and subsequent release of more fresh water. 

The next five IBRD events (AID5-AID2), which are of comparable 
concentration and duration, show a pacing with a repeat time of about 
800-900 years. AID2 has the largest IBRD flux of these five events, attain- 
ing its peak flux at about 11.3 kyr ago, which corresponds to the age of 
MWP- 1B recorded in the Barbados sea-level record’’. The absence of 
an equivalent sea-level event in the Tahiti coral record’ might reflect 
different far-field sea-level responses to a deglaciation event from the 
Pacific sector of the AIS”. Although our IBRD record does not distinguish 


Figure 3 | IBRD flux in the Scotia Sea compared 
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to climate changes during the last deglaciation. 
a, Stacked IBRD flux record. b, 500-yr averages of 
8S stacked IBRD flux relative to Holocene average 
So (Methods). c, Antarctic deglacial 3)? Catm Stack 
=e with 1o error (shaded)"*. d, Biogenic opal flux 
records from the southeast Atlantic site TN057-14 
(dashed)”* and the Scotia Sea sites MD07-3133 
(black) and MD07-3134 (grey)”’. e, EDML ice-core 
-46 record of sea-salt Na* flux"’. f, 5!8O record from 
R % EDML ice core’. g, 8180 record from NGRIP ice 
~9o core*®’. Note that MWP-1A is coeval with AID6, 
50 within the dating uncertainties. Note further that 
Ee major changes in the Southern Hemisphere 
commenced around 17 kyr ago when the Northern 
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the source of mass loss from the AIS, the presence of AID2 is consistent 
with an Antarctic contribution to MWP-1B as well’. 

The youngest IBRD event (AID1) started about 10 kyr ago, with the 
peak flux occurring around 9.5 kyr ago; it ended about 9 kyr ago. The 
only other available records that span this time are from the Ross Sea 
sector of the Western AIS, but they suggest that retreat of the ground- 
ing line in the Ross Sea occurred either before or after AID1. The specific 
sector of the AIS that contributed to this event thus remains unknown 
(Methods). 

Our IBRD record shows clear evidence for episodic mass loss of the 
AIS during the last deglaciation, but further work is required to identify 
the sources of mass loss associated with each AID (Methods). Millennial- 
scale AIDs, similar to those observed in the Northern Hemisphere’, are 
indicative of abrupt ice-sheet instabilities” that initiated within one to 
a few decades, released fresh water into the Southern Ocean, and con- 
tributed to the overall deglacial sea-level rise. The peak of AID6 is syn- 
chronous, within the dating uncertainties, with MWP-1A. Modelling 
results indicate that ocean thermal forcing from the transport of warm 
Circumpolar Deep Water to the base of ice shelves may have been an 
important positive feedback in AIS deglacial dynamics. Highlighting the 
role of ocean thermal forcing and rapid ice-sheet responses, our study 
provides an important palaeoclimate context for contemporaneous~** 


and future*>”® interactions between the ice sheet and the ocean around 
Antarctica. 
METHODS SUMMARY 


Chronology. Age models for cores MD07-3133 and MD07-3134 (ref. 12) were 
established by correlating their magnetic susceptibility signal, as well as the Ca and 
Fe records of MD07-3134, both of which largely reflect dust deposition in the circum- 
Antarctic region, to the dust (non-sea-salt Ca) flux record of the EDML ice core”. We 
assume that sediment-core magnetic susceptibility, Fe, Ca and ice-core non-sea- 
salt Ca are in phase because all signals are atmospheric, and therefore essentially 
reached marine and ice-sheet deposition sites synchronously. Sites MD07-3133 
and MD07-3134 have average sedimentation rates of 92 cm kyr *and63cmkyr * 
over the last 36 kyr and 93 kyr, respectively, yielding a deglacial sample resolution 
for IBRD of 5-15 yr (Methods). 

Sample analysis. IBRD was counted every centimetre on X-radiographs taken 
from 1-cm-thick slices that were cut out from the centre of each core segment and 
exposed to an X-ray system (Methods). Biogenic opal was determined using a 
combination ofa traditional leaching method and newly introduced Fourier trans- 
form infrared spectroscopy. 

Earth system modelling experiments. Coordinated Southern Ocean freshwater 
perturbation experiments were conducted with the Bern3D and LOVECLIM Earth 
system models of intermediate complexity and the coupled general circulation model 
COSMOS (Methods). The IBRD flux stack (Fig. 3a) is used as a Southern Ocean 
freshwater forcing time series for these models, assuming that 50% of the MWP-1A 
forcing originated from Antarctica, and attaining peak values of about 0.22 Sv at 
14.67 kyr ago, coincident with the onset of the Bolling interstadial (14.64 kyr ago) 
and MWP-1A (14.65 kyr ago). 
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Figure 4 | Three-dimensional pattern of 
temperature anomalies at 14.8-14 kyr ago 
(AID6). The map shows mean surface air 
temperature anomaly; the vertical profile indicates 
zonally averaged ocean temperature anomaly in 
response to an Antarctic meltwater pulse of about 
0.22 Sv at around 14.6 kyr ago. The multi-model 
mean is calculated by averaging the respective 
anomalies obtained from the Bern3D, LOVECLIM 
and COSMOS models (Methods). White-and- 
purple dashed and purple solid lines show the 
annual mean extent of 20-cm-thick sea ice for 
14.8 kyr ago and 14.6 kyr ago, respectively, 
simulated by the LOVECLIM model. Note that 
freshwater forcing (a) causes surface cooling, an 
increase in zonally averaged sea-ice extent (b) and 
averaged subsurface warming between 800 m 

and 1,200 m and 63 °S and 70 °S (c). NADW, 
North Atlantic Deep Water; CDW, Circumpolar 
Deep Water. 
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METHODS 

Material and sediment composition. If not tabulated here, data from this study 
are available at the Pangaea data library (http://dx.doi.org/10.1594/PANGAEA. 
819646). Sediment cores MD07-3133 (57° 26’ S, 43° 27’ W; 3,101 m water depth; 
32.8 m core length) and MD07-3134 (59° 25’ S, 41° 28’ W; 3,663 m water depth; 
58.2 m core length) were obtained from the northern end of Dove Basin and Pirie 
Bank, respectively, in the southern part of the central Scotia Sea (Fig. 1). Both sites 
are located well below the carbonate compensation depth, that is, the sediments are 
largely carbonate-free and consist primarily of biogenic opal and detrital material. 
The relative proportions of these two components depend on climate conditions 
(see below): warm periods are associated with homogenous, olive grey to yellowish 
diatomaceous oozes, whereas cold periods correspond to grey to blue-grey diatom- 
bearing mud, and climate transitions are associated with olive-grey diatomaceous 
mud”. 

Non-destructive measurements. We conducted a number of non-destructive 
measurements”. We determined magnetic susceptibility (kappa volume specific) with 
a Bartington coil sensor for whole-core and a point sensor for split-core measure- 
ments at 1-cm increments using a multi-sensor core logger*’. We also measured the 
distribution of chemical elements using an AVAATECH X-ray fluorescence core 
scanner”. Results for Ca are given as peak area intensities determined in total counts 
per second. We performed the analysis using a sample spot of 1 cm X 1 cm in size 
at 1-cm increments”. 

Wealso used a Minolta spectrophotometer CM-2002 to measure L*, a* and b* 
as well as RGB colour components™ at 1-cm increments. RGB colour values were 
converted to R/G/B” and plotted with PanPlot (www.pangaea.de/software/PanPlot) 
to obtain visual colour logs (Fig. 2). 

Biogenic opal determination. Biogenic opal was determined by several methods 
for sites MD07-3133 and MD07-3134 (ref. 29). We first analysed biogenic opal by 
leaching the sediment in 1 M NaOH solution” to obtain ground-truth information 
at critical transitions (MD07-3133: 1,655-2,785 cm, 112 samples; MD07-3134: 745- 
1,785 cm, 4,135-4,585 cm; 141 samples). We then implemented a new and fast 
technique that relies on this ground-truthing, called Fourier transform infrared 
spectroscopy”, to obtain information on biogenic opal at 10-cm increments over 
the entire core lengths of sites MD07-3133 and MD07-3134 (316 and 575 samples, 
respectively). Note that biogenic opal flux rates in Fig. 3d are normalized for *°°Th 
to account for sediment focusing with the exception of MD07-3133, where no 
?3°Th data have been obtained so far”. 

Chronology. The low-resolution chronology of sites MD07-3133 and MD07-3134, 
which extend back to 36 kyr ago and 93 kyr ago, respectively, is described in ref. 12. 
Itis based on several ground-truth data points associated with Marine Isotopic Stage 
boundaries, the correlation to rather massive layers rich in coarser-grained detritus”*, 
and the identification of the Laschamp event. The high-resolution chronology 
relies on dust-climate couplings between Southern Ocean sediment and the Ant- 
arctic EDML ice core'’. The use of marine Fe records as a proxy for dust is a well- 
established approach reflecting coherent and synchronous changes in dust deposition 
across much of the Southern Ocean and the AIS, and has recently also been imple- 
mented to construct age models for records from the Atlantic’ and the Pacific’ 
sectors of the Southern Ocean to reconstruct Southern Hemisphere dust climate 
couplings during the Late Quaternary using Fe counts. In addition, a number of pre- 
vious workers have identified a remarkable similarity in the structure of variability 
of magnetic susceptibility in cores from the Scotia Sea and the variability in dust 
records from Antarctic ice cores*!’. Pugh et al. noted this similarity in magnetic 
susceptibility records from three cores in the Scotia Sea (that is, the same region as 
our records cover), and explicitly tested whether the magnetic susceptibility signal 
was synchronous with the EPICA ice-core dust record. They first established two 
independent age models for their cores from (1) biostratigraphy (variations in 
Cycladophora davisiana abundance tied to the LR04 time scale 5'°O global stack“*) 
and (2) radiocarbon dating, and then compared those age models to one constructed 
by synchronizing their magnetic susceptibility records to the ice-core dust record. 
They concluded that these three independently derived age models are “mutually 
consistent over their common ranges” and that millennial-scale fluctuations in mag- 
netic susceptibility and in Antarctic dust concentration were synchronous, supporting 
our strategy of tuning Scotia Sea magnetic susceptibility records to Antarctic ice- 
core dust records to develop an age model for our marine records’. 

Weber et al.'? also noted a remarkable similarity in the structure of variability of 
Ca in the cores from the Scotia Sea and the variability in the non-sea-salt Ca records 
from Antarctic ice cores. Weber et al.” measured Ca by X-ray fluorescence which, 
because the Scotia Sea sites do not contain biogenic Ca, is the same proxy for dust as 
used in the ice cores (non-sea-salt Ca), and found that when Ca is tuned to non- 
sea-salt Ca (which is completely independent of the magnetic susceptibility tuning), 
the resulting correlation between the two is as strong as the correlation of magnetic 
susceptibility to non-sea-salt Ca, providing additional support for synchroneity, 
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because both marine and ice-core Ca records are proxies for dust, with the most 
likely physical process being short-lived atmospheric transport. 

Given these independent lines of evidence supporting synchronous changes, we 
tuned magnetic susceptibility and Ca in our cores to the non-sea-salt Ca flux record 
from the EDML ice core’. We used three statistical approaches to produce a tuned, 
high-resolution age model, evaluate its quality and provide error estimates'”. Imple- 
menting dynamic time wrapping algorithms, for instance, helped to detect circula- 
rities by computing a time (depth) axis stretch and mapping two time series optimally 
by minimizing the cumulative distance without any pre-defined tie points. The 
resulting age—depth structure obtained from dynamic time wrapping was virtually 
identical to the one obtained by the tuning procedure, which served as indepen- 
dent verification of our final tuning with tie points. 

Our deglacial age model is based primarily on five stratigraphic tie points (TP1 
to TP5) that are shared between the records and reveal a detailed record for the 
period about 20-8 kyr ago over 10 mand7 mcore lengths for sites MD07-3133 and 
MD07-3134, respectively (Extended Data Fig. 1). The five tie points correspond to 
consistent breaks in slopes and characteristic and reproducible peaks in the records 
of magnetic susceptibility and Ca relative to non-sea-salt Ca. Our deglacial chro- 
nology also includes Fe, which shows the same characteristic trends and features as 
magnetic susceptibility and Ca, and which has been used to synchronize Southern 
Ocean marine sediment and Antarctic ice-core records*”*®. TP1 (about 19 kyr ago) 
is one of many correlation points for the Last Glacial Maximum used in ref. 12. It 
marks the end of distinctly elevated values in magnetic susceptibility, Ca, Fe and 
non-sea-salt Ca towards the end of this phase. TP2 (about 17.6 kyr ago) is defined 
on the basis ofa steep increase in slope in each of the records. A subsequent decrease 
in slope in each of the records is the basis for TP3 around 16.5 kyr ago. TP4 occurs 
another 1,800 years later with another distinct decrease in slope in each record 
around 14.7 kyr ago and a corresponding distinct peak in the non-sea-salt Ca and 
magnetic susceptibility records. TP5 is recorded in all archives as a distinct peak 
that occurred around 8.4 kyr ago. 

In addition to these common tie points, site MD07-3134 and EDML share a 
dust peak at 12.85 kyr ago that is not preserved at site MD07-3133. Sites MD07- 
3133 and MD07-3134 share a weak dust peak at 11.42 kyr ago, which was taken as 
an additional age control for MD07-3133. This correlation is supported by the fact 
that colour component a* (red-green variability) shows a distinct peak at 8.5-8.3 kyr 
ago at sites MD07-3133 and MD07-3134, which also serves as control point at both 
sites (Fig. 2c, d). This peak is replicated at 11.42 kyr ago (that is, at 19.00 m at site 
MD07-3133, and at 11.00 m at site MD07-3134). 

Based on our age model, sedimentation rates in our cores are high, 0.2-1.2 m kyr! 
for site MD07-3134 (63 cm kyr”! on average over the last 93 kyr) and 0.3-2.1 mkyr! 
for site MD07-3133 (92 cm kyr’ ' on average over the last 36 kyr), translating into 
mass accumulation rates of 10-120 gcm* kyr ' and 15-170 gem” kyr’ ', and into 
sample resolutions of 8-50 yr and 5-33 yr (given a sample increment of 1 cm) for 
IBRD counting, respectively. 

Considering the chronological limitations and problems that are associated with 

most Southern Ocean sediment archives, the age model of our two Scotia Sea sites 
has a high resolution and should be very accurate because the uncertainties assoc- 
iated with 4C dating, such as reservoir age, are circumvented. Therefore, our expanded 
deglacial record of about 20-8 kyr ago (documented over 7 m and 10 m core length in 
sites MD07-3134 and MD07-3133, respectively) is chronologically better constrained 
than the deglacial signals that have been inferred from shallow-water archives around 
the Antarctic continent, such as at Mac Robertson Land’, where inferred changes 
usually occur over a few centimetres in condensed, lithologically complex succes- 
sions with partially reversed ‘*C ages that have uncertain reservoir ages. 
Age scale and AID uncertainty estimates. We use the EDML1/EDC3 age model 
that was developed in conjunction with the EDC age model’’ by incorporating 
ground-truth stratigraphic evidence, most importantly the correlation of regional 
volcanic ash layers. Based on this age model, three AIDs in the Scotia Sea correlate 
well with large-scale events: the peak of AID7 with the major retreat of the Weddell 
Sea part of the East AIS at 16.1 kyr ago’, the peak of AID6 with MWP-1A around 
14.7 kyr ago, and AID2 with MWP-1B at 11.3 kyr ago. 

The original age scale EDML1/EDC3 relied on ash correlation to EDC**””. The 
errors of this correlation are a couple of decades. For ages younger than 41 kyr ago, 
the EDML1/EDC3 age model is based on correlation of methane and '°Be to the Green- 
land Ice Core Chronology 2005 (GICC05), which largely relies on layer counting”. 
The errors of GICC05 were also a couple of decades for the Holocene, and up to 
two centuries for the time around MWP- 1A. Subsequent revision to the EDML age 
model” and the latest Antarctic-wide ice-core chronology”, called AICC2012, tried 
to account for regional stratigraphic constraints and the need to adjust ages to achieve 
interhemispheric methane and '°Be synchronization during deglaciation. 

AICC2012 shows little to no difference to EDML1/EDC3 for most of the Holo- 
cene: that is, the timing of AID1 and AID2 are not affected. However, from about 
8-18 kyr ago, AICC2012 becomes progressively older, with an age difference peaking 
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at about 550 yr at 18 kyr ago, and declining again to about 200 yr from the Last 
Glacial Maximum. For the time of MWP- 1A, for instance, the age difference is almost 
400 yr. This would increase the age of the tie point, which occurs right at the start 
of AIE6, from 14.8 kyr ago to almost 15.2 kyr ago, implying that the peak of AIE6 
dates 15.1-14.7 kyr ago instead of 14.8-14.4 kyr ago. The entire AIE6 would hence 
be 15.2-14.2 kyr ago. 

However, this age shift is only achieved at the expense of a significantly larger 
dating uncertainty than the original EDML1/EDC3 chronology. For the Holocene 
the uncertainty is around 200 yr (ref. 51). At about 12 kyr ago, it increases to almost 
500 yr, and it reaches 1,000 yr by about 19 kyr ago. The original EDML1/EDC3 age 
scale, however, has much lower uncertainties. From approximately 10 kyr ago to 
14 kyr ago, uncertainties increase from only a few decades to around 200 yr, respec- 
tively (Extended Data Fig. 2d), and then further increase to about 700 yr at around 
18 kyr ago. One should note that, although AICC2012 is applied to all Antarctic ice 
cores, the only ground-truth correlation of EDML is established via volcanics to EDC. 

Synchronization of the new West AIS ice core to EDML via layer counting between 
volcanic acidic horizons, which has not been included in AICC2012, indicates that 
AICC2012 is significantly too old for the pre-Holocene time (Winstrup, M., Vinther, 
B. M., Sigl, M., McConnell, J., Svensson, A. M. and Wegner, A., manuscript in prep- 
aration). For the depth around MWP-1A, for instance (about 800 m at EDML), 
this new chronology will again be closer to the timing of the original EDML1/EDC3 
timescale. Also, the uncertainly will again be much lower, given the low counting 
error of annual bands between volcanic ashes. 

Since the ageing of AICC2012 is only achieved at the expense of much higher 
uncertainties relative to the EDML1/EDC3 age scale, and all shifts are within the 
uncertainties of the different ice-core age models, we use the original EMDL1/EDC3 
age model. We note also that its dating uncertainties are lower and that it has more 
ground-truth chronological constraints. 

The original uncertainty estimates of the Scotia Sea sites’* built upon a bootstrap 

algorithm that simulates possibly wrong tie points and therefore provides conser- 
vative error estimates. However, these uncertainty estimates do not include ice-core 
age uncertainties. We used the resulting dating errors (2c) of the tie point correla- 
tion to calculate uncertainties for the start, end and duration of AIDs (Extended 
Data Fig. 2a). In addition, we considered the uncertainties in the EDML1 chronology” 
as well as the EDC3 chronology** when calculating the full uncertainties for the 
AIDs (Extended Data Fig. 2b). All uncertainties are incorporated into the absolute 
dating uncertainty (2c error) of the AIDs (Extended Data Table 1). The bootstrap 
algorithm allows for the derivation of relative dating uncertainties for the AIDs. 
Since interpolation errors of nearby layers are highly correlated, these uncertainties 
are naturally smaller than the absolute dating uncertainties (Extended Data Fig. 2c). 
Hence, significantly more precise estimates can be made on the duration of and the 
distance between AIDs. 
IBRD counting and conversion. We cut out 1-cm-thick, 10-cm-wide and 25-cm- 
long slices from the centre of each core section, placed them on plastic plates and 
exposed the slices to film-sensitive negatives for 3 min to 5 min using a HP 43855 
X-ray system. The negatives were then scanned at 300 dpi (dots per inch) and ana- 
lysed for the amount of IBRD using a 1-mm by 1-mm grid. 

We counted all visible grains >1 mm in diameter per centimetre core depth* (that 
is, per 10 cm? volume) as an indicator for the content of IBRD (Extended Data Fig. 3). 
We distinguished between grains 1-2 mm and >2 mm in diameter. Also, to account 
for operator subjectivity in IBRD counts, three different individuals conducted inde- 
pendent counts, which were then averaged for Fig. 2c and d. For Fig. 2e, we stacked 
the results by re-sampling each data series at 10-yr increments, adding the individual 
counts from the two cores, and then dividing them by 2. For Fig. 2f, we con- 
verted the IBRD time series from sites MD07-3133 and MD07- 1334 into flux rates, 
using the detailed age models”, and, again, stacked the results. For Fig. 3b, we averaged 
the stacked IBRD flux rates into 500-yr slices. 

The stacked IBRD flux-rate record indicates, on average, rather stable values 
with little multi-centennial variability of 0.01 grains cm” yr” ' for 25-21 kyr ago, and 
0.015 grains cm’ yr’ for 8-0 kyr ago. We infer that these low values represent steady- 
state conditions, that is, the amount of iceberg calving off the Antarctic continent is 
balanced by accumulation on the ice sheet. During deglaciation, however, flux rates 
were distinctly above these low values, and 10-15 times higher during meltwater 
pulses. Even when averaged into 500-yr slices (grey histogram line, Fig. 3e), melt- 
water pulses show IBRD flux rates 4-7 times higher. 

We note that icebergs usually lose the majority of their coarse-grained IBRD (peb- 
bles to boulders) immediately after calving proximal to the AIS grounding lines. 
The largest IBRD reaching our distal core positions in Iceberg Alley is usually 
1-3 mm in size and dispersed in a diatom-rich or clay-rich matrix (Extended Data 
Fig. 3). These particles embedded within icebergs can travel over long distances 
and argue against a strong regional bias in our IBRD data set. 

On the relation of AIS mass loss, icebergs and IBRD. The majority of IBRD 
originates from bedrock, either directly (through scouring) or indirectly (through 


refreezing of sediment-laden water at the bed), and is entrained in the deeper part 
ofa glacier. Where glaciers merge, medial moraines can cause IBRD to occur over a 
much greater thickness, sometimes even throughout the ice column (W. F. Budd, 
personal communication, 2013). Other processes internal to ice flow (for example, 
compressive flow) can also transport debris from the bed well up into the ice column. 
Only a small fraction will be delivered onto the ice surface, for example, by rock fall, 
or by wind transport in the case of finer grains. 

In the floating ice shelves most IBRD near the base will melt out relatively close to 
the source***’, immediately in front of the grounding line. In ice shelves with high 
melt rates, the whole sediment-rich basal layer might ablate. However, in some ice 
shelves tributary glaciers can carry IBRD straight into a basal freeze-on zone, where 
they are protected from melting out by a layer of meteoric ice (for example, the west 
side of the Amery; see figures 6a and 9a of ref. 55). This process and the poten- 
tial for localized IBRD throughout the ice column allow some IBRD to reach the 
front of large ice shelves. 

The fact that we capture mostly small, englacial IBRD in Iceberg Alley indicates 
that the majority of the coarse-grained IBRD has been deposited before the icebergs 
reached the Scotia Sea sites. The fact that englacial material can travel long dis- 
tances and will probably survive the counterclockwise journey around Antarctica 
(see below) within the Antarctic Coastal Current could therefore indicate that 
regional bias is not strong and no large nonlinearities exist between the amount 
of entrained IBRD detected in the Scotia Sea and the number of icebergs calving off 
the AIS. 

Iceberg Alley today and during deglaciation. The term ‘Iceberg Alley” refers to 
the confluence zone where Antarctic icebergs merge” and exit the Weddell Sea to 
the north into the Scotia Sea”. Icebergs take several months to two years to com- 
plete their journey counterclockwise around Antarctica’. Although icebergs lose some 
mass in transit (through wave erosion, friction or collisions with sea ice), it is a rapid 
journey in cold waters. Therefore, melt rates remain low until the warmer Antarctic 
Circumpolar Current is reached, after which the icebergs ablate rapidly”’. This is one 
of the reasons why the sites from Iceberg Alley should contain a circum-Antarctic- 
wide IBRD signal, although it is also reasonable to assume that the Weddell Sea 
part of the East AIS and the Antarctica Peninsula Ice Sheet provide more IBRD than 
more distal sources. This regional bias has been detected before in IBRD provenance 
studies, although deglaciations may involve different iceberg fluxes from various 
sources at different times as well as longer transport distances due to colder climates”. 

A substantial percentage of large icebergs reach Iceberg Alley through the Ant- 
arctic Coastal Current''. Some of the smaller icebergs (for example, in the vicinity 
of the Kerguelen plateau) may stray directly north due to topographic steering of 
ocean currents, but even the smaller icebergs mostly stay entrained in the Antarctic 
Coastal Current. Most icebergs calving from the Western AIS are not likely to reach 
Iceberg Alley through the Antarctic Coastal Current because there is a small north- 
wards and clockwise escape route from the coastal drift (see arrow in Fig. 1; refs 9, 11). 
However, it is conceivable that these icebergs could also come through the Scotia 
Sea and affect the IBRD record of our sites. 

Several factors indicate that the present-day iceberg routeing through Iceberg 
Alley would have remained largely the same during the last deglaciation. The Ant- 
arctic Divergence is the major driver of the ocean—atmospheric circulation in the 
Southern Ocean. It forces winds north of it to blow east and to feed clockwise into 
the Antarctic Circumpolar Current. South of the Antarctic Divergence, winds blow 
west and feed counterclockwise into the Antarctic Coastal Current around Antarctica. 
The Antarctic Circumpolar Current is constrained to flow south of the tip of South 
America at 56° S as it passes from the Pacific Ocean to the Atlantic Ocean, and its 
mean position therefore lies between 50° S and 55° S (ref. 61). The Antarctic Circum- 
polar Current and Antarctic Coastal Current will thus exist under different climates 
of the past**”, although specific fronts associated with them probably shifted north 
or south, depending on the state of the climate and its associated sea-ice coverage. 

An additional factor that is important in routeing of icebergs discharged from 
the AIS is the Coriolis force, which depends on the rotation of the Earth and is there- 
fore independent of the state of the climate. It causes all moving objects to be deflected 
to the left in the Southern Hemisphere. Thus, all icebergs calving from the AIS are 
forced to move counterclockwise and maintain a course parallel to the edge of the 
Antarctic continent. Moreover, the Coriolis force exerts a greater force on the larger 
icebergs”, which contribute most of the AIS mass loss, and they thus largely remain 
entrained in the Antarctic Coastal Current (that is, sea ice does not have a large 
effect on altering the course of big icebergs); preferably smaller icebergs and crawlers 
would have escaped the Antarctic Coastal Current during times of thicker sea-ice 
coverage further away from the continent. 

Finally, several studies*® have reconstructed extensive coastal polynyas (areas 
of open water surrounded by sea ice) immediately in front of the AIS during colder 
periods when sea-ice coverage was dense further away from the continent. This fur- 
ther implies a vigorous Antarctic Coastal Current also during glacial and deglacial 
times. In summary, there are several lines of evidence that iceberg routeing through 
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Iceberg Alley operated in much the same way during deglaciation as is observed and 
modelled today. 

We note that Kanfoush et al.** detected layers rich in coarser-grained detritus 
across the southern Atlantic, and named them SAO (about 14-15 kyr ago) to SA6 
(about 55 kyr ago). Whereas they interpreted these layers as IBRD and also inferred 
episodes of Antarctic ice-sheet instability from them, subsequent studies™ demon- 
strated that the material was volcanic ash, most probably from the South Sandwich 
Islands, and concluded that the ash was deposited on sea ice, which then trans- 
ported the material. In any event, the majority of Antarctic icebergs are entrained 
into the Antarctic Circumpolar Current in Iceberg Alley and melt long before these 
distal South Atlantic sites can be reached*’. Moreover, the layers described by 
Kanfoush et al.** are distinct and massive, whereas our IBRD counts usually indicate 
small (1-3 mm in diameter) and dispersed grains embedded in a diatom- or clay- 
rich matrix. In addition, our IBRD records show no long-term correlation to the 
sea-salt Na record (Fig. 3f) of the EDML ice core, a proxy for sea ice. Accordingly, we 
are confident that the Scotia Sea IBRD counts indicate a source from icebergs with 
varying fluxes rather than a fallout from volcanic eruptions and transport by sea ice. 
Deglacial meltwater sources. During deglaciation, specific parts of the AIS con- 
tributed to specific retreat events, that is, ice-sheet retreat probably did not occur 
contemporaneously across the entire AIS. The Weddell Sea sector of the East AIS 
probably contributed to AID8 (MWP-19KA; ref. 1) and AID7, with evidence that 
ice-sheet retreat occurred 19 kyr ago and 16 kyr ago, respectively’. Early deglacia- 
tion of the Antarctic Peninsula® and the Lambert glacier-Amery ice shelf system 
may also have contributed icebergs to these events. In addition, the Pine Island gla- 
cier of the Amundsen Sea embayment retreated at 16.4 kyr ago”, and may have con- 
tributed to AID7. 

Evidence for widespread retreat 14-15 kyr ago comes from additional sectors 

of the East AIS, that is, Mac Robertson Land’, as well as the Ross Sea sector of the 
West AIS*® and the Antarctic Peninsula Ice Sheet*”®, suggesting multiple sources 
for AID6 (MWP-1A). Such a change in the distribution of meltwater sources is sup- 
ported by our freshwater experiments, which simulate increased subsurface tempe- 
rature responses when sources are shifted from west to east during deglaciation. 
Subsequent retreat during AID2 (MWP- 1B) is recorded at Wilkes Land, part of the 
East AIS”". Available terrestrial and shallow-marine records from the Ross Sea sector 
of the West AIS suggest retreat of the grounding line in the Ross Sea as having 
occurred either before” or after* AID1. Therefore, the specific sector of the AIS 
that contributed to this event remains unknown. 
Ice-sheet/ocean feedbacks during AID6. To study the response of the Southern 
Hemisphere climate system to AID6-related iceberg surges, a series of coordinated 
transient climate modelling experiments is conducted with a hierarchy of climate 
models using time-varying Southern Ocean freshwater forcing (Extended Data 
Table 1). The freshwater forcing, which is evenly applied to the climate models in 
the Weddell and Ross seas, is a linearly scaled version of the IBRD flux stack of 
Fig. 3a. The scaling factor is determined by assuming that 50% of the maximum 
sea level rates during MWP-1A (40 mmyr_')' originated from Antarctica. The 
corresponding peak freshwater forcing around Antarctica during AID6 reaches 
values of 0.22 Sv (Fig. 4). It should be noted here that the anomalous fresh water 
released from the Ross and Weddell seas is picked up by the Antarctic Circumpolar 
Current. This results in an efficient mixing of the freshwater around Antarctica, 
mimicking to some degree the effect of drifting icebergs’ and their corresponding 
freshwater fluxes. The multi-model approach allows us to determine the robust fea- 
tures of the climate response to an Antarctic iceberg discharge 14.8-14.0 kyr ago 
(Fig. 3). 

The following three climate models have been used: (1) The Bern3D model”*”* 
is an intermediate-complexity model based on an energy-balance model for the 
atmosphere and a frictional geostrophic model for the ocean. Other components 
included in the Bern3D model, such as a carbon cycle and sediment component, 
are not considered here. The horizontal resolution adopted here is 5° by 10°. Atmo- 
spheric winds are prescribed at present-day values, thus inhibiting large-scale atmo- 
spheric circulation changes in response to the climate change. The freshwater forcing 
is applied to the Bern3D model under climate boundary conditions that represent 
the Last Glacial Maximum. (2) The global climate model LOVECLIM uses a sim- 
plified dynamical atmosphere coupled to an ocean general circulation model, and 
dynamic/thermodynamic sea-ice and diagnostic vegetation models. It has been 
used in a number of palaeoclimate studies focusing on the Southern Ocean'*”*”*. 
Horizontal resolutions of the atmosphere and the ocean are 5.6° by 5.6° and 3° by 
3°, respectively. Here the Southern Ocean freshwater forcing is applied to constant 
climate boundary conditions that represent the climate around 15 kyr ago”. These 
initial conditions were obtained from 5,000-yr-long spin-up experiments that use 
15-kyr-ago greenhouse gas concentrations, orbital forcing and ice-sheet forcing. 
(3) COSMOS isa comprehensive coupled general circulation model including a land 
surface scheme and a dynamic sea-ice model. It has been extensively used to study 
various key palaeoclimate periods” *°. Horizontal resolutions of the atmosphere 
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and the ocean are 3.75° and 3°, respectively. The freshwater forcing is applied under 
15-kyr-ago boundary conditions. The initial state for the freshwater forcing experi- 
ment was obtained from a transient deglacial simulation varying CO, concentra- 
tions derived from EDC ice core* and orbital forcing. 

Mimicking the effects of AID6, the applied Southern Hemisphere freshwater 
forcing causes widespread surface cooling in the Southern Hemisphere and an 
increase of Southern Ocean sea-ice area (Fig. 4), in accordance with the sea-salt 
Na" flux data from EPICA Dronning Maud Land (Fig. 3e). This cooling would 
contribute to establishing the Antarctic Cold Reversal and the bipolar seesaw response 
during the Bolling—Allerod. Moreover, the freshwater forcing generates a strong 
halocline, which inhibits formation of very cold Antarctic Bottom Water, which is 
then replaced by considerably warmer Circumpolar Deep Water (Fig. 4). Follow- 
ing previous studies'*”, we hypothesize that the associated sub-surface warming 
at depths of 100-1,500 m can cause ice-shelf melting, grounding-line retreat, calv- 
ing and subsequent release of more fresh water, all of which act as positive feedbacks. 
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Extended Data Figure 2 | Uncertainty estimates for AIDs. Conservative 
error estimates (2c) rely on bootstrapping of different age models and 
projecting them on to AIDs. a, Errors of the MD age model’* based on tie point 
correlation only. Black dots depict the centre of the AID and its absolute 
uncertainty range. Black error bars at the boxes mark the relative uncertainty 
with respect to the centre. Grey error bars show the absolute uncertainty of the 
beginning and end of each AID. b, Errors including the EDML] (ref. 47), and 
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EDC3 (ref. 46) uncertainties. c, Relative duration of AIDs and related 
uncertainties. d, Error propagation of the three different age scales through the 
last deglaciation. IU is interpolation uncertainty. Note that uncertainties are 
highly correlated for nearby ages. Accounting for this correlation, the duration 
of each AID as well as the time between two AIDs is significantly more accurate 
than its absolute age uncertainty. 
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Extended Data Figure 3 | X-radiograph images from Scotia Sea Site higher numbers indicate enhanced iceberg routeing through Iceberg 
MD07-3134. IBRD (bright dropstones) are embedded in a matrix-supported Alley during three distinct deglaciation phases (centre panels): AID8 (MWP- 
diatomaceous mud. Low IBRD contents are documented for the Last Glacial | 19KA), AID7 and AID6 (MWP-1A). 

Maximum (LGM, ~24.7 kyr ago) and the Holocene (~8.8 kyr ago), whereas 
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Extended Data Table 1 | Uncertainty estimates 


AID Top Error (20) Centre Error (2c) Bottom — Error (2c) 
No. (kyr) (kyr) (kyr) 
AID1 8.99 (0.32, 9.63 (0.29, 10.28 (0.27, 
0.38) 0.38) 0.38) 
AID2 11.00 (0.26, 11.23 (0.25, 11.46 (0.25, 
0.39) 0.39) 0.39) 
AID3 11.78 (0.24, 11.90 (0.24, 12.03 (0.24, 
0.40) 0.41) 0.42) 
AID4 12.71 (0.22, 12.88 (0.22, 13.04 (0.22, 
0.42) 0.42) 0.42) 
AID5 13.28 (0.21, 13.46 (0.21, 13.65 (0.20, 
0.43) 0.43) 0.44) 
AID6 13.94 (0.19, 14.40 (0.17, 14.86 (0.16, 
0.44) 0.50) 0.55) 
AID7 15.75 = (0.13, 16.33 (0.13, 16.91 (0.13, 
0.67) 0.76) 0.85) 
AID8 19.00 (0.07, 19.54 (0.08, 20.08 (0.09, 
1.03) 1.04) 1.05) 


Top, bottom, and centre of deglacial AID1 to AID8, as well as absolute dating uncertainties in terms of the 2c error (parenthesis). The first uncertainty relates to the MDO7 chronology only??, the one below to the 
overall uncertainty including the EDML1 (ref. 47) and EDC3 (ref. 46) chronology. 
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Increasing CO, threatens human nutrition 


Samuel S. Myers', Antonella Zanobetti’, Itai Kloog®, Peter Huybers*, Andrew D. B. Leakey? , Arnold J. Bloom®, Eli Carlisle°, 
Lee H. Dietterich’, Glenn Fitzgerald’, Toshihiro Hasegawa’, N. Michele Holbrook'®, Randall L. Nelson!!, Michael J. Ottman!’, 
Victor Raboy”’, Hidemitsu Sakai’, Karla A. Sartor’, Joel Schwartz’, Saman Seneweera!°, Michael Tausz!® & Yasuhiro Usui? 


Dietary deficiencies of zinc and iron are a substantial global public 
health problem. An estimated two billion people suffer these defi- 
ciencies’, causing a loss of 63 million life-years annually~*. Most of 
these people depend on C, grains and legumes as their primary 
dietary source of zinc and iron. Here we report that C; grains and 
legumes have lower concentrations of zinc and iron when grown 
under field conditions at the elevated atmospheric CO, concentra- 
tion predicted for the middle of this century. C3 crops other than 
legumes also have lower concentrations of protein, whereas C, 
crops seem to be less affected. Differences between cultivars of a 
single crop suggest that breeding for decreased sensitivity to atmo- 
spheric CO, concentration could partly address these new chal- 
lenges to global health. 

In the 1990s, several investigators found that elevated atmospheric 
CO, concentration (hereafter abbreviated to [CO,]) decreased the 
concentrations of zinc, iron and protein in grains of wheat*’, barley” 
and rice* grown in controlled-environment chambers. However, sub- 
sequent studies failed to replicate these results when plants were grown 
in open-top chambers and free-air CO2 enrichment (FACE) experi- 
ments. A previous study’ found no effect of [CO,] on the concentra- 
tions of zinc or iron in rice grains grown under FACE and suggested 
that the earlier findings had been influenced by ‘pot effects’, by which a 
small rooting volume led to nutrient dilution at the root-soil interface. 
Of the more recent studies'*'*, most have indicated lower elemental 
concentrations in soybeans’, sorghum”’, potatoes"’, wheat” or barley”* 
grown at elevated [CO,], but with the exception of iron in one study on 
wheat’, these results were statistically insignificant, perhaps because of 
small sample sizes. 

Small sample sizes have limited the statistical power of individual 
studies of many aspects of plant responses to elevated [CO], and meta- 
analyses involving larger samples of genotypes, environmental condi- 
tions and experimental locations have been important in resolving 
which elements of plant function respond reliably to altered [CO.]’*””. 
A recent meta-analysis of published data concluded that only sulphur is 
decreased in grains grown at elevated [CO ]"*. 

Here we report findings from a meta-analysis of newly acquired data 
from 143 comparisons of the edible portions of crops grown at ambient 
and elevated [CO] from seven different FACE experimental locations 
in Japan, Australia and the United States involving six food crops (see 
Table 1). We tested the nutrient concentrations of the edible portions 
of rice (Oryza sativa, 18 cultivars), wheat (Triticum aestivum, 8 culti- 
vars), maize (Zea mays, 2 cultivars), soybeans (Glycine max, 7 culti- 
vars), field peas (Pisum sativum, 5 cultivars) and sorghum (Sorghum 
bicolor, 1 cultivar). In all, forty-one genotypes were tested over one to six 
growing seasons at ambient and elevated [CO ], where the latter was in 
the range 546-586 p.p.m. across all seven study sites. Collectively, these 


experiments contribute more than tenfold more data regarding both 
the zinc and iron content of the edible portions of crops grown under 
FACE conditions than is currently available in the literature. Consistent 
with earlier meta-analyses of other aspects of plant function under FACE 
conditions’*”’, we considered the response comparisons observed from 
different species, cultivars and stress treatments and from different years 
to be independent. The natural logarithm of the mean response ratio 
(r = response in elevated [CO,]/response in ambient [CO,]) was used as 
the metric for all analyses. Meta-analysis was used to estimate the overall 
effect of elevated [CO.] on the concentration of each nutrient in a parti- 
cular crop and to determine the significance of this effect (see Methods). 

We found that elevated [CO,] was associated with significant de- 
creases in the concentrations of zinc and iron in all C; grasses and le- 
gumes (Fig. 1 and Extended Data Table 1). For example, wheat grains 
grown at elevated [CO,] had 9.3% lower zinc (95% confidence interval 
(CI) —12.7% to —5.9%) and 5.1% lower iron (95% CI —6.5% to —3.7%) 
than those grown at ambient [CO,]. We also found that elevated [CO,] 
was associated with lower protein content in C3 grasses, with a 6.3% 
decrease (95% CI —7.5% to —5.2%) in wheat grains and a 7.8% decrease 
(95% CI —8.9% to — 6.8%) in rice grains. Elevated [CO,] was associated 
with a small decrease in protein in field peas, and there was no significant 
effect in soybeans or C, crops (Fig. 1 and Extended Data Table 1). 

In addition to our own observations, we obtained data from 10 of 11 
previously published studies investigating nutrient changes in the edible 
portion of food crops (Extended Data Table 6) and combined these data 
with our own observations in a larger meta-analysis. Analysis of our 
results combined with previously published FACE data (Extended Data 
Table 2), or combined with previously published data from both FACE 
and chamber experiments (Extended Data Table 3), was consistent with 
the results obtained using only our new data. Combining our data with 
previously published data did not alter the significance or substantially 
alter the effect size of the nutrient changes for any crop or any nutrient. 

In addition to nutrient concentrations, we also measured phytate, a 
phosphate storage molecule present in most plants that inhibits the 
absorption of dietary zinc in the human gut'’. We had noa priori reason 
to assume that phytate concentrations would change in response to 
rising [CO,]. However, formulae for calculating absorbed, or bioavail- 
able, zinc depend on both the amount of dietary zinc and the amount of 
dietary phytate consumed”, making it important to interpret changes 
in zinc concentration in the context of possible changes in phytate. 
Phytate content decreased significantly at elevated [CO] only in wheat 
(P< 0.01). This decrease might offset some of the declines in zinc for 
this particular crop, although the decrease was slightly less than half of 
the decrease in zinc. For other crops examined, however, the lack of a 
concurrent decrease in phytate may further exacerbate problems of zinc 
deficiency. 
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Table 1 | Characteristics of agricultural experiments 


Crops Country Treatments used Years grown Number of replicates Number of cultivars COz ambient/elevated (p.p.m.) 
Wheat 
Site 1 Australia 2 water levels, 2007-2010 4 8 382/546-550 
2 nitrogen treatments, 
2 sowing times 
Site 2 Australia 1 water level, 2007-2009 4 1 382/546-550 
1 nitrogen treatment, 
2 sowing times 
Field peas Australia 2 water levels 2010 4 5 382/546-550 
Rice 
Site 1 Japan 1 nitrogen treatment, 2007-2008 3 3 376-379/570-576 
2 warming treatments 
Site 2 Japan 3 nitrogen treatments, 2010 4 18 386/584 
2 warming treatments 
Maize United States 2 nitrogen treatments 2008 4 2 385/550 
Soybeans United States 1 treatment 2001, 2002, 2004, 4 7 372-385/550 
2006-2008 
Sorghum United States 2 water levels 1998-1999 4 1 363-373/556-579 


‘Number of replicates’ refers to the number of identical cultivars grown under identical conditions in the same year and location but in separate FACE rings. 


The global [CO.] in the atmosphere is expected to reach 550 p.p.m. 
in the next 40-60 years, even if further actions are taken to decrease 
emissions’*. At these concentrations, we find that the edible portions of 
many of the key crops for human nutrition have decreased nutritional 
value when compared with the same plants grown under identical 
conditions but at the present ambient [CO,]. Analysis of the United 
Nations’ Food and Agriculture Organization food balance sheets reveals 
that in 2010 roughly 2.3 billion people were living in countries whose 
populations received at least 60% of their dietary zinc and/or iron from 
C; grains and legumes, and 1.9 billion lived in countries that received at 
least 70% of one or both of these nutrients from these crops (Extended 
Data Table 5). Reductions in the zinc and iron content of the edible 
portion of these food crops will increase the risk of zinc and iron defi- 
ciencies across these populations and will add to the already considerable 
burden of disease associated with them. 

The implications of decreased protein concentrations in non- 
leguminous C; crops are less clear. From a study of adult men and women 
in the United States, there is strong evidence that the substitution of die- 
tary carbohydrate for dietary protein increased the risk of hypertension, 
lipid disorders, and 10-year coronary heart disease risk’’. For the devel- 
oping world, minimum protein requirements for different demographic 
groups are an area of active research and debate”®. For countries such as 
India, however, in which up to one-third of the rural population is thought 
to be at risk of not meeting protein requirements”’ and in which most 


protein comes in the form of C; grains”, decreased protein content in non- 
leguminous C; crops may have serious consequences for public health. 

Whereas zinc and iron were significantly decreased in all C3 crops 
tested, only iron in maize was observed to decrease among the C, crops. 
No changes were found in sorghum. That zinc and iron declines were 
notable in C; crops but less so in C, crops is consistent with differences 
in physiology. C, crops concentrate CO, internally, which results in 
photosynthesis being CO2-saturated even under ambient [CO,] con- 
ditions, leading to no stimulation of photosynthetic carbon assimila- 
tion at elevated [CO,] levels under mesic growing conditions”. Our 
finding that protein content was less affected in legumes than in other 
C; crops is also physiologically consistent with the general ability of 
leguminous crops to match the stimulation of photosynthetic carbon 
gain at elevated [CO.] with greater nitrogen fixation, to maintain tissue 
carbon:nitrogen (C:N) ratios”’. In contrast, most temperate non-legume 
C; crops are generally unable to extract and assimilate sufficient nitro- 
gen from soils to maintain tissue C:N ratios**”*. 

Little is known about the mechanism(s) responsible for the decline 
in nutrient concentrations associated with elevated [CO,]. Some authors 
have proposed ‘carbohydrate dilution’, by which CO2-stimulated carbo- 
hydrate production by plants dilutes the rest of the grain components”. 
To test this hypothesis, we measured concentrations of additional ele- 
ments for all crops except wheat (Extended Data Table 4). Our findings 
were inconsistent with carbohydrate dilution operating alone. If only 
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Figure 1 | Percentage change in nutrients at elevated [CO,] relative to 

ambient [CO,]. Numbers in parentheses refer to the number of comparisons 
in which replicates of a particular cultivar grown at a specific site under one set 
of growing conditions in one year at elevated [CO ] have been pooled and for 
which mean nutrient values for these replicates are compared with mean values 
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for identical cultivars under identical growing conditions except grown at 
ambient [CO ]. In most instances, data from four replicates were pooled for 
each value, meaning that eight experiments were combined for each 
comparison (see Table 1 for details of experiments). Error bars represent 
95% confidence intervals of the estimates. 
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Figure 2 | Percentage change (with 95% confidence intervals) in nutrients at elevated [CO] relative to ambient [CO], by cultivar. a, Zinc; b, iron; c, protein. 


passive dilution of nutrients were occurring, we would have expected to 
see very similar changes in the concentration of each nutrient tested for 
a given crop. In contrast, we found that elemental changes in the indi- 
vidual crops are distinct from each other. For example, in rice grains 
(Extended Data Table 4) the decrease in zinc concentrations associated 
with elevated [CO,] was significantly different from the decreases in 
the concentrations of copper (P = 0.001), calcium (P = 0.001), boron 
(P = 0.001) and phosphate (P = 0.010). This heterogeneous response 
was also observed in recent analyses reviewing possible mechanisms 
for nutrient changes in both edible and non-edible plant tissues grown 
at elevated [CO,]’’. It also seems that the mechanism(s) causing these 
changes operate distinctly in different species. In one instance, for example, 
we found boron to be significantly decreased in soybeans (P = 0.001), 
whereas it was significantly elevated in rice grains (P = 0.001). Although 
these differences may, in part, have derived from different envir- 
onmental conditions, they suggest that the mechanism is more complex 
than carbohydrate dilution alone. Of all the elements, changes in nitro- 
gen content at elevated [CO,] have been the most studied, and inhibition 
of photorespiration and malate production”, carbohydrate dilution’®, 
slower uptake of nitrogen in roots” and decreased transpiration-driven 
mass flow of nitrogen’ may all be significant. 

Wealso examined the effects of elevated [CO.] on zinc, iron and pro- 
tein content as a function of cultivar when data were available (Fig. 2). 
Whereas most crops showed negligible differences across cultivars, con- 
centrations of zinc and iron across rice cultivars varied substantially 
(P = 0.04 and P = 0.03, respectively; Fig. 2a, b). Such differences between 


cultivars suggest a basis for breeding rice cultivars whose micronutrient 
levels are less vulnerable to increasing [CO ]. Similar effects may occur 
in other crops, given that the statistical power of many of our other inter- 
cultivar tests was limited by sample size. We note, however, that such 
breeding programmes will not be a panacea for many reasons includ- 
ing the affordability of improved seeds and the numerous criteria used 
by farmers in making planting decisions that include taste, tradition, 
marketability, growing requirements and yield. In addition, as has been 
noted previously, there are likely to be trade-offs with respect to yield 
and other performance characteristics when breeding for increased zinc 
and iron content”. 

The public health implications of global climate change are difficult to 
predict, and we expect many surprises. The finding that raising atmo- 
spheric [CO ] lowers the nutritional value of C3 food crops is one such 
surprise that we can now better predict and prepare for. In addition to 
efforts to limit increases in [CO,], it may be important to develop breed- 
ing programmes designed to decrease the vulnerability of key crops to 
these changes. Nutritional analysis of which human populations are most 
vulnerable to decreased dietary availability of zinc, iron and protein from 
C; crops could help to target response efforts, including breeding de- 
creased sensitivity to elevated [CO,], biofortification, and supplementation. 


METHODS SUMMARY 


We examined the response of nutrient levels to elevated atmospheric [CO,] for the 
edible portions of rice (Oryza sativa, 18 cultivars), wheat (Triticum aestivum, 8 
cultivars), maize (Zea mays, 2 cultivars), soybeans (Glycine max, 7 cultivars), field 
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peas (Pisum sativum, 5 cultivars) and sorghum (Sorghum bicolor, 1 cultivar). The 
six crops were grown under FACE conditions; in all six experiments the elevated 
[CO.] was in the range 546-586 p.p.m. 

In accordance with methods described previously’*”*, the natural logarithm of 
the response ratio (r = response in elevated [CO 2]/response in ambient [CO,]) 
was used as the metric for analyses and is reported as the mean percentage change 
(100 X (r — 1)) at elevated [CO.]. Consistent with these earlier analyses of mul- 
tiple species grown under FACE conditions, the responses of different species, 
cultivars and stress treatments and from different years of the FACE experiments 
were considered to be independent and suited to meta-analytic analysis". 

The meta-analysis was designed to estimate the effect of elevated [CO] on the 
concentration of each nutrient in a particular crop and to determine the signifi- 
cance of this effect relative to a null hypothesis of no change. All tests were conducted 
as two-sided; that is, not specifying which direction the nutrient concentrations were 
expected to change under elevated [CO]. Meta-analysis was conducted with a linear 
mixed model. 

Parameter estimates were obtained by the restricted maximum-likelihood method, 
a standard approach for analysing repeated measurements” that, in our case, were 
of nutrient concentrations at the time of harvest. Results for all analyses are re- 
ported as the best estimate of percentage changes in the concentration of nutrients 
along with the 95% confidence intervals associated with each estimate. Two-tailed 
P values are also reported. 
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METHODS 


We examined the response of nutrient levels to elevated atmospheric [CO,] for the 
edible portions of rice (Oryza sativa, 18 cultivars), wheat (Triticum aestivum, 8 
cultivars), maize (Zea mays, 2 cultivars), soybeans (Glycine max, 7 cultivars), field 
peas (Pisum sativum, 5 cultivars) and sorghum (Sorghum bicolor, 1 cultivar). The 
six crops were grown under FACE conditions; in all six experiments, the elevated 
[CO ] was in the range 546-586 p.p.m. (see the Agricultural Methods section below 
for details associated with individual trials). 

Statistics. In accordance with methods described previously'*”*, the natural log- 
arithm of the response ratio (r = response in elevated [CO]/response in ambient 
[CO ]) was used as the metric for analyses and is reported as the mean percentage 
change (100 X (r — 1)) at elevated [CO]. Consistent with these earlier analyses of 
multiple species grown under FACE conditions, the responses of different species, 
cultivars and stress treatments and from different years of the FACE experiments 
were considered to be independent and suited to meta-analytic analysis’*. 

The meta-analysis was designed to estimate the overall effect of elevated [CO.] 
on the concentration of each nutrient in a particular crop and to determine the 
significance of this effect relative to a null hypothesis of no change. All tests were 
conducted as two-sided—not specifying which direction the nutrient concentrations 
were expected to change under elevated [CO,]—to make the analysis as general as 
possible. Meta-analysis was conducted with a linear mixed model. A random 
intercept was included for each comparison, representing nutrient level variability 
unrelated to [CO] that was common to both treatment groups. Additional ana- 
lyses indicated that the effect of [CO] on zinc concentration in rice was modified 
by cultivar and amount of nitrogen application, suggesting systematic variations 
across the pooled analysis of rice, and for these samples it was shown that the effect 
on zinc concentration was still significant when including interactions terms for 
cultivar and nitrogen. No other significant modifications of the [CO] effect were 
identified. We tested whether changes in different nutrients for particular crops 
were statistically different from each other, as has been described*’. To address the 
issue of multiple comparisons when testing for differences between cultivars 
within a crop, we multiplied the P value by the number of independent compar- 
isons. This approach follows the so-called Bonferroni correction and is conservat- 
ive in the sense of biasing the P values high, but still shows that individual test results 
are significant despite their having been selected from multiple tests. 

Parameter estimates were obtained by the restricted maximum-likelihood method, 
a standard approach for analysing repeated measurement data” that, in our case, 
were of nutrient concentrations at time of harvest. Results for all analyses are 
reported as the best estimate of percentage changes in the concentration of nutri- 
ents along with the 95% confidence intervals associated with each estimate. Two- 
tailed P values are also reported. 

When combining our data with previously published data, we defined outliers 

as pairs in which the difference between an observation at ambient [CO] and 
elevated [CO] was at least three times the standard deviation from the mean 
differences for that crop and nutrient type when calculated using all observations. 
Using this criterion, we excluded a total of two pairs of previously published data 
from all analyses; these included one observation ofiron in rice and one observation 
of zinc in potato. 
Agricultural methods. Rice (Oryza sativa, 18 cultivars), wheat (Triticum aesti- 
vum, 8 cultivars), maize (Zea mays, 2 cultivars), soybeans (Glycine max, 7 culti- 
vars), field peas (Pisum sativum, 4 cultivars) and sorghum (Sorghum bicolor, 1 
cultivar) were grown under FACE conditions during daylight hours. The experi- 
ments were conducted in Australia, Japan and the United States between 1998 and 
2010. Ambient [CO,] ranges were between 363 and 386 p.p.m.; elevated [CO.] was 
between 546 and 584p.p.m. With the exception of soybeans, each experiment 
involveed multiple cultivars of each crop and more than one set of growing con- 
ditions. Each experiment for each cultivar and set of treatments was replicated four 
times, with the exception of one of the rice sites, for which three replicates were 
performed. These data are summarized in Table 1, and additional details of the soil 
and growing conditions, FACE methods and experimental designs have been 
published for rice*’, wheat”, maize”’, soybeans™, field peas” and sorghum”. 
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Minerals method. Samples were analysed for minerals by heated closed-vessel 
digestion/dissolution with nitric acid and hydrogen peroxide followed by quan- 
tification with an inductively coupled plasma atomic emission spectrometer”. 
Nitrogen content was measured by flash combustion of the sample coupled with 
thermal conductivity/infrared detection of the combustion gases (N2, NO, and 
CO ) with a LECO TruSpec CN Analyzer”. Protein values are based on measure- 
ment of nitrogen and conversion to protein with the equation below, where k = 5.36 
(ref. 38): 


protein (weight %) = k X nitrogen (weight %) 


For phytic acid determination, a modified version of the method of ref. 39 was used. 
The accuracy of the method was monitored by the inclusion of tissue standards of 
known and varying levels of phytic acid”. 

Dietary calculations. The United Nations Food and Agriculture Organization 
(UNFAO) publishes annual Food Balance Sheets, which provide country-specific 
data on the quantities of 95 ‘standardized’ food commodities available for human 
consumption. Data, expressed in terms of dietary energy (kilocalories per person 
per day) were downloaded for 210 countries and territories with available informa- 
tion for the period 2003-2007 (available at http://faostat.fao.org). The percentage of 
dietary energy available from C3 grasses (wheat, barley, rye, oats, rice and ‘cereals, 
other’ (excluding Eragrostis tef)) was calculated globally with estimates weighted by 
national population size (188 countries available; UN 2011; 2012 revision available 
at http://esa.un.org/wpp/). 

Dietary intake data from the UNFAO Food Balance Sheets (to year 2000) and 
food composition data from the United States Department of Agriculture National 
Nutrient Database for Standard Reference were used to calculate per-person nutri- 
ent intake for 95 food items; these were shared with us with permission*!. This data 
set was used to calculate the contribution of each food item to total dietary zinc and 
iron intake, and the proportions of all food items derived from C; grains and 
legumes were summed to identify countries that are highly dependent on plant 
sources of iron and zinc (Extended Data Table 5). 
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Extended Data Table 1 | Percentage change in nutrient content at elevated [CO.] relative to ambient [CO2] 


N* Zn (ug/g) Fe (ug/g) Protein (mg/g) Phytate (g/100g) 
(number of 
pairs) % 95% CI P-value 95% CI P-value 95% CI P-value 95% CI P-value 
C3 grasses 
Wheat 64 -9.3 (-12.7,-5.9)  <.0001 (-6.5,-3.7) P (-7.5,-5.2) <.0001 (-7.5,-0.8) 
Rice 31 -3.3 (-5.0,-1.7) <.0001 ; (-7.6,-2.9) : (-8.9,-6.8) <.0001 (-4.6,7.4) 
C3 legumes 
Field peas 10 -6.8 (-9.8,-3.8) 0.002 ; (-6.7,-1.4) : (-4.0,-0.1) 0.039 (-11.5,0.1) 
Soybeans 25 -5.1 (-6.4,-3.9) <.0001 (-5.8,-2.5) (-0.4,1.3) 0.267 (-3.7,1.2) 
C4 grasses 
Maize 4 -5.2 (-10.7,0.6) 0.077 (-10.9,-0.3) : (-13.0,4.5) 0.312 -6.1 (-15.0,3.7) 0.215 
Sorghum 4 -1.3 (-6.2,3.8) 0.603 (-5.8,9.7) (-4.9,5.2) 0.993 12.8 (-15.8,51.1) 0.418 


*‘Number of pairs’ refers to the number of comparisons in which replicates of a particular cultivar grown at a specific site under one set of growing conditions in one year at elevated [CO2] have been pooled and 
mean nutrient values for these replicates were compared with mean values for identical cultivars under identical growing conditions except grown at ambient [CO2]. In most instances, data from four replicates 
were pooled for each value, meaning that eight experiments were combined for each comparison (see Table 1 for details of experiments). 
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Extended Data Table 2 | Original data combined with previously published FACE data from studies 3, 4, 6 and 7 


Field peas 
Soybeans 


C, tubers 
Potato 


Sorghum 


% 


Zn (\1g/g) 


95% CI 


(-11.9,-5.6) 
(-4.8,-1.5) 
(-19.3,-2.7) 


(-9.8,-3.8) 
(-6.4,-3.9) 


(-12.9,6.2) 


(-10.7,0.6) 
(-6.2,3.8) 


P-value 


-5.5 
-4.9 
-10.5 


-4.1 
-4.1 


2.3 


-5.8 
1.6 


Fe (ug/g) 


95% CI 


(-6.8,-4.1) 
(-7.3,-2.6) 
(-12.2,-8.7) 


(-6.7,-1.4) 
(-5.8,-2.5) 


(-3.8,8.7) 


(-10.9,-0.3) 
(-5.8,9.7) 


P-value 


<.0001 
<.0001 
<.0001 


0.003 
<.0001 


See Extended Data Table 6 for a list of experiments. Percentage change in nutrient content at elevated [COz] relative to ambient [COz]. 
*‘Number of pairs’ refers to the number of comparisons in which replicates of a particular cultivar grown at a specific site under one set of growing conditions in one year at elevated [CO2] have been pooled and 
mean nutrient values for these replicates were compared with mean values for identical cultivars under identical growing conditions except grown at ambient [COz]. In most instances, data from four replicates 
were pooled for each value, meaning that eight experiments were combined for each comparison (see Table 1 for details of experiments). 
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-6.5 
-8 
-11.9 


Protein (mg/g) 


95% CI P-value 


(-7.5,-5.4) 
(-9.0,-6.9) 
(-13.1,-10.7 


(-4.0,-0.1) 
(-0.4,1.3) 


(-7.7,-1.4) 


(-13.0,4.5) 0.312 
(-4.9,5.2) 0.993 
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Extended Data Table 3 


(number of 


C3 grasses 
Wheat 
Rice 
Barley 


C3 legumes 
Field peas 
Soybeans 


C, tubers 
Potato 


C4 grasses 
Maize 
Sorghum 


pairs) 


Zn (ug/g) 
95% CI 


(-12.1,-6.1) 
(-4.8,-1.5) 
(-19.3,-7.6) 


(-9.8,-3.8) 
(-6.1,-3.9) 


(-20.9,2.4) 


(-10.7,0.6) 
(-4.5,3.4) 


P-value 


95% CI P-value 


<.0001 
<.0001 
<.0001 


5.9  (-7.8,-4.0) 
49  (-7.3,-2.6) 
-10.0  (-12.4,-7.4) 


(-6.7,-1.4) 
(-7.9,-2.5) 


(-16.6,10.3) 


(-10.9,-0.3) 
(-10.2,99.3) 


See Extended Data Table 6 for a list of experiments. Percentage change in nutrient content at elevated [COz] relative to ambient [CO2]. 
*‘Number of pairs’ refers to the number of comparisons in which replicates of a particular cultivar grown at a specific site under one set of growing conditions in one year at elevated [CO2] have been pooled and 
mean nutrient values for these replicates were compared with mean values for identical cultivars under identical growing conditions except grown at ambient [CO3]. In most instances, data from four replicates 
were pooled for each value, meaning that eight experiments were combined for each comparison (see Table 1 for details of experiments). 
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Original data combined with previously published FACE and chamber data from studies 1-10 


Fe (g/g) Protein (mg/g) 


95% CI P-value 


7.2 (-8.6,-5.8) 
-8  (-9.0,-6.9) 
“15.0 (-19.1,-10.7 


21 (-4.0,-0.1) 
0.1 (-0.8,0.9) 


97 15.9,-3.1) 


-4.6  (-13.0,4.5) 
5.6 (-12.7,2.1) 
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Extended Data Table 4 | Percentage change in nutrient content at elevated [CO2] compared with ambient [CO,] for all nutrients 


% 95% CI P-value % 95% CI P-value % 95% Cl P-value Ge 95% Cl P-value % 95% CI P-value Go 95% CL P-value 


Zinc (ppm) (-12.7,-5.9) <.0001 5. (-5.0,-1.7) (-9.8,-3.8) <.0001 5. 4,3: <.0001 3 (-10.7,0.6) 0.077 (6.2.3.8) 0.603 
Iron (ppm) (-6.5,-3.7) <.0001 . (-7.6,-2.9) ; (-6.7,-1.4) <.0001 I 2.5 <.0001 . (-10.9,-0.3) 0.038 x (-5.8,9.7) 0.674 
Phytate (mg/g) (-7.5,-0.8) 0.009 F (-4.6,7.4) fs (-1L.5,0.1) 0.055 3 (-3.7,1.2) 0.303 mI (-15.0,3.7) (-15.8,51.1) 0.418 
Protein (-7.5,-5.2) <.0001 x (-8.9,-6.8) 5 (-4.0,-0.1) 0.039 .5 (0.4.1.3) 0.267 (-13.0,4.5) 3 (4.9.5.2) 0.993 
Mn (ppm) f (-12.0,-2.8) 5 (-4.2,-0.8) 0.005 : (3.5.0.8) 0.204 x 215 ! (-4.5,8.3) 0,596 
Mg (%) x -3,0.6) . 0.960 3.5 3 <.0001 1.3 0.011 - ‘ 0.944 
Cu (ppm) . 0.025 (-8.0,-3.4 <.0001 x 0.066 A 0.190 
Ca (%) - 0.787 5. <.0001 9,13 0.734 2 5 3 0.190 


S (ppm) i . 3.6, 0.003 <.0001 i! 0.342 -0.2 0.936 
K (%) ¢ 3,2. (0.6,3.8) 0.008 . 0.857 I (-3.1,-2.2) <.0001 3 f 0.308 
B (ppm) 5 (1.9,8.4) m (3.9.0.1) 0.057 . (-9.1,-3.6) <.0001 R (-L.0,11.1) 0.107 -0.3 (-9.3,9.6) 0.952 
P(%) (-2.4,0.4) i (-6.8,-0.5) 0.023 (2.2.0.9) 0.379 i (-9.0,-5.1) <.0001 .3 4.9) 0.881 


Sample sizes for each crop type are identical to those listed in Table 1. 
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Extended Data Table 5 | Countries whose populations receive at least 60% of dietary iron and/or zinc from C3 grains and legumes 


Gout % Tron from C; % Zinc from C; Population 
ountry 
grains & legumes grains & legumes (in thousands) 


Afghanistan 78% 78% 31,412 
Algeria 716% 719% 35,468 
Iraq 74% 83% 31,672 
Bangladesh 72% 88% 148,692 
Iran, Islamic Rep of 72% 771% 73,974 
Pakistan 70% 72% 173,593 
Tunisia 70% 771% 10,481 
Jordan 69% 73% 6,187 
Morocco 69% 78% 31,951 
Syrian Arab Republic 67% 11% 20,411 
Libya 67% 11% 6,355 
Yemen 66% 715% 24,053 
Myanmar 65% 81% 47,963 
Tajikistan 62% 56% 6,879 
India 59% 11% 1,224,614 
Egypt 54% 65% 81,121 
Indonesia 52% 65% 239,871 
Sierra Leone 51% 710% 5,868 
Cambodia 49% 68% 14,138 
Sri Lanka 46% 69% 20,860 
Laos 44% 66% 6,201 
Viet Nam 43% 61% 87,848 
Total 2,329,612 


Source: United Nations Food and Agriculture Organization food balance sheets and 2010 United Nations estimated population. 
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Extended Data Table 6 | Literature reporting nutrient changes in the edible portion of crops grown at elevated and ambient [CO2] 


1]Growth Chambers Conroy , J., Seneweera, S. P., Basra, A., Rogers, G. & Nissen-Wooller, B. Influence of rising atmospheric CO, concentrations and temperature on growth, 
yield and grain quality of cereal crops. Australian Journal of Plant Physiology 21, 741-758 (1994). 


Seneweera , S., Milham, P. & Conroy, J. Influence of elevated CO, and phosphorus nutrition on the growth and yield of a short-duration rice. Australian 
Journal of Plant Physiology 21, 281-292 (1994). 


Seneweera, S. P. & Conroy, J. P. Growth, grain yield and quality of rice (Oryza sativa L.) in response to elevated CO, and phosphorus nutrition (Reprinted 
from Plant nutrition for sustainable food production and environment, 1997). Soil Sci. Plant Nutr. 43, 1131-1136 (1997). 


2) Temperature Gradient Tunnels De la Puente, L. S., Perez, P. P., Martinez-Carrasco, R., Morcuende, R. M. & Del Molino, |. M. M. Action of elevated CO, and high temperatures on the 
mineral chemical composition of two varieties of wheat. Agrochimica 44, 221-230 (2000). 

3]Open Top Chambers & FACE De Temmerman L et al. Effect of climatic conditions on tuber yield (Solanum tuberosum L.) in the European ‘CHIP’ experiments. European Journal of 
Agronomy 17, 243-255 (2002). 


De Temmerman, L., Hacour, A. & Guns, M. Changing climate and potential impacts on potato yields and quality ‘CHIP’: introduction, aims and 
methodology. European Journal of Agronomy 17, 233-242 (2002). 


Fangmeier, A., De Temmerman, L., Black, C., Persson, K. & Vorne, V. Effects of elevated CO, and/or ozone on nutrient concentrations and nutrient uptake 
of potatoes. European Journal of Agronomy 17, 353-368 (2002). 


Hégy, P. & Fangmeier, A. Atmospheric CO, enrichment affects potatoes: 2. Tuber quality traits. European Journal of Agronomy 30, 85-94 (2009). 
Erbs, M. et al. Effects of free-air CO, enrichment and nitrogen supply on grain quality parameters and elemental composition of wheat and barley grown in 
a crop rotation. Agriculture, Ecosystems and Environment 136, 59-68 (2010). 


Fangmeier, A. et al. Effects of elevated CO., nitrogen supply and tropospheric ozone on spring wheat. |. Growth and yield. Environmental Pollution 91, 381 
390 (1996). 


Fangmeier, A., Griters, U., Hégy, P., Vermehren, B. & Jager, H.-J. Effects of elevated COz, nitrogen supply and tropospheric ozone on spring wheat — II. 
Nutrients (N, P, K, S, Ca, Mg, Fe, Mn, Zn). Environmental Pollution 96, 43-59 (1997). 


Fangmeier, A. et al. Effects on nutrients and on grain quality in spring wheat crops grown under elevated CO; concentrations and stress conditions in the 
European, multiple-site experiment 'ESPACE-wheat'. European Journal of Agronomy 10, 215-229 (1999). 


Jager , H.-J., Hertstein, U. & Fangmeier, A. The European Stress Physiology and Climate Experiment — project 1: wheat (ESPACE-wheat): introduction, 
aims and methodology. European Journal of Agronomy 10, 155-162 (1999). 


Hégy, P. & Fangmeier, A. Effects of elevated atmospheric CO, on grain quality of wheat. Journal of Cereal Science 48, 580-591 (2008). 
Hégy, P. et al. Does elevated atmospheric CO, allow for sufficient wheat grain quality in the future? . Journal of Applied Botany and Food Quality 82, 114- 
121 (2009). 


Hégy, P. et al. Effects of elevated CO, on grain yield and quality of wheat: results from a 3-year free-air CO2 enrichment experiment. Plant Biology 11, 60- 
69 (2009). 


Hégy, P., Zérb, C., Langenkamper, G., Betsche, T. & Fangmeier, A. Atmospheric CO, enrichment changes the wheat grain proteome. Journal of Cereal 
Science 50, 248-254 (2009). 


Kim , H., Lieffering, M., Miura, S., Kobayashi, K. & Okada, M. Growth and nitrogen uptake of CO,-enriched rice under field conditions. New Phytologist 
150, 223-229 (2001). 


Kim , H. et al. Effects of free-air CO, enrichment and nitrogen supply on the yield of temperate paddy rice crops. Field Crops Research 83, 261-270 
(2003). 


Lieffering, M., Kim, H.-Y., Kobayashi, K. & Okada, M. The impact of elevated CO, on the elemental concentrations of field-grown rice grains. Field Crops 
Research 88, 279-286 (2004). 


Si Open Top Chambers Pleijel, H. et al. Effects of elevated carbon dioxide, ozone and water availability on spring wheat growth and yield. Physiologia Plantarum 108, 61-70 
(2000). 
Pleijel, H. & Danielsson, H. Yield dilution of grain Zn in wheat grown in open-top chamber experiments with elevated CO, and O3 exposure. Journal of 
Cereal Science 50, 278-282 (2009). 
(Open Top Chambers Prior, S. A., Runion, G. B., Rogers, H. H., Torbert, H. A. Effects of atmospheric CO, enrichment on crop nutrient dynamics under no-till conditions. Journal 
of Plant Nutrition 31, 758-773 (2008). 
10{Open Top Chambers Weigel, H., Manderscheid, R., Jager, H.-J. & Mejer, G. Effects of season-long CO, enrichment on cereals. |. Growth performance and yield. Agriculture, 
Ecosystems and Environment 48, 231-240 (1994). 
Manderscheid, R., Bender, J., Jager, H., J & Weigel, H., J. Effects of season long CO, enrichment on cereals. II. Nutrient concentrations and grain quality. 
Agriculture, Ecosystems & Environment 54, 175-185 (1995). 
TIPFACE Yang, L., Wang, Y., Dong, G., Gu, H., Huang, J., Zhu, J., Yang, H., Liu, G., Han, Y. The impact of free-air CO, enrichment (FACE) and nitrogen supply on 
grain quality of rice. Field Crops Research 102, 128-140 (2007). 
Meta-Analyses Loladze, |. Rising atmospheric CO, and human nutrition: toward globally imbalanced plant stoichiometry? Trends in Ecology and Evolution 17 (10), 457- 
461 (2002). [Uses data from studies 1, 2, 5, and 10 as well as numerous other studies on non-edible tissues and plants other than food crops]. 
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Coordination of entorhinal-hippocampal ensemble 
activity during associative learning 


Kei M. Igarashi’, Li Lu', Laura L. Colgin?, May-Britt Moser’ & Edvard I. Moser! 


Accumulating evidence points to cortical oscillations as a mecha- 
nism for mediating interactions among functionally specialized neu- 
rons in distributed brain circuits’’*. A brain function that may use 
such interactions is declarative memory—that is, memory that can 
be consciously recalled, such as episodes and facts. Declarative mem- 
ory is enabled by circuits in the entorhinal cortex that interface the 
hippocampus with the neocortex’*. During encoding and retrieval 
of declarative memories, entorhinal and hippocampal circuits are 
thought to interact via theta and gamma oscillations*®**, which in 
awake rodents predominate frequency spectra in both regions”. 
In favour of this idea, theta-gamma coupling has been observed 
between entorhinal cortex and hippocampus under steady-state con- 
ditions in well-trained rats'’; however, the relationship between inter- 
regional coupling and memory formation remains poorly understood. 
Here we show, by multisite recording at successive stages of asso- 
ciative learning, that the coherence of firing patterns in directly con- 
nected entorhinal-hippocampus circuits evolves as rats learn to use 
an odour cue to guide navigational behaviour, and that such coher- 
ence is invariably linked to the development of ensemble representa- 
tions for unique trial outcomes in each area. Entorhinal-hippocampal 
coupling was observed specifically in the 20-40-hertz frequency band 
and specifically between the distal part of hippocampal area CA1 and 
the lateral part of entorhinal cortex, the subfields that receive the pre- 
dominant olfactory input to the hippocampal region’’. Collectively, 
the results identify 20-40-hertz oscillations as a mechanism for syn- 
chronizing evolving representations in dispersed neural circuits dur- 
ing encoding and retrieval of olfactory-spatial associative memory. 

Neural activity was recorded from entorhinal cortex (EC) and CA1 
of 17 rats trained to solve a simplified version of an odour-place asso- 
ciation task thought to depend on interfacing of the hippocampus with 
inputs from the olfactory bulb and the piriform cortex via the EC’ 
(Fig. 1 and Supplementary Video 1). On each trial, the animal sampled 
odours ina cue port for 1 s and then, depending on odour identity, ran 
to either of two cups for food reward (Fig. 1b). Following 3 weeks of 
training, the percentage of correct trials increased to asymptotic levels, 
reaching a criterion of two consecutive sessions of 85% correct perfor- 
mance after 16.8 + 0.8 days (mean + s.e.m.; Fig. 1c; repeated measures 
analysis of variance (ANOVA): F(20, 320) = 95.6, P< 0.001). Training- 
day numbers 2, 6, 10, 14 and 18 were defined as T1-T5, respectively. 
T5 was post-criterion for all animals. 

We first examined spectral activity in CA1 of five well-trained ani- 
mals. After these animals had reached the 85% performance criterion, 
electrodes were implanted across the transverse axis of CA1 (Fig. la 
and Extended Data Fig. 1a). Analyses focused on activity during the cue- 
sampling period, when recall of odour-place associations was expected 
to be initiated (Fig. 1d and Extended Data Fig. 2). Cue sampling was 
associated with strong oscillatory activity in the local field potential (LFP) 
of CA1 (Fig. le). The activity change was strongest in the 20-40-Hz 
frequency range, below the gamma-band frequencies that dominate run- 
ning behaviour'®’*"* and in agreement with previous observations in 
olfaction-based discrimination tasks’*. Some increase was observed at 


~15-20 Hz, but a 20-Hz cut-off was used to avoid confounds with theta 
harmonics (~ 14-18 Hz; Fig. le and Extended Data Fig. 3). The power 
of the 20-40-Hz oscillation increased gradually during the cue samp- 
ling interval, peaked during the second half, and dropped back as soon 
as the animal left the port (Fig. le, f, and Extended Data Figs 3 and 4). 
The magnitude of this oscillation increased from proximal to distal 
CA1 (dCA1; Extended Data Fig. 5), consistent with the idea that beta- 
frequency olfactory inputs mediated by the lateral part of the EC (LEC) 
preferentially reach dCA1"*”*. 

We next asked whether 20-40-Hz oscillations in dCA1 are accom- 
panied by similar LFP activity in EC. In 9 rats, we recorded simulta- 
neously from dCA1 and layer III of either lateral or medial EC (LEC: 
n = 20 tetrode pairs in 5 rats; MEC: n = 16 tetrode pairs in 4 rats; Ex- 
tended Data Fig. la-c). In all animals that reached the 85% criterion, 
LEC exhibited strong 20-40-Hz oscillations during cue sampling (Fig. 1g 
and Extended Data Fig. 3). These oscillations were highly coherent with 
20-40-Hz oscillations in dCA1 (Fig. 1h, i). Both power and coherence 
reached peak values during the second half of the cue interval (Fig. 1g, h). 
Coherence was increased only rarely during the inter-cue period (Fig. 1f, 
j,k). The changes were restricted to LEC. No increase in 20-40-Hz oscil- 
lations was detected in MEC (Fig. 1g), and oscillations in dCA1 showed 
stronger coupling with LEC than MEC at all 0.97-Hz frequency bins 
between 15.6 and 39.0 Hz but not at any lower or higher frequencies 
(false discovery rate (FDR) corrected for 62 multiple comparisons at 
0-60 Hz, q < 0.05; Fig. 1h, i). Instead, and in agreement with previous 
results!”, the MEC electrodes showed fast gamma oscillations that coupled 
moderately with dCA1 during the running segments of the task (peak 
frequency at 72.1 and 61.0 Hz for power and coherence, respectively; 
Extended Data Fig. 3). 

The observed 20-40-Hz coherence between LEC and CA1 could reflect 
disynaptic LEC-CA3 and CA3-CA1 coupling”. If this were the case, 
LEC-CA1 coherence would be expressed in both proximal CA1 (pCA1) 
and dCA1, considering that LEC input terminates indiscriminately along 
the CA3 transverse axis'*. However, simultaneous recordings from LEC 
and pCA1 showed no increase in coherence during the cue interval (3 rats; 
Fig. 1h). At all frequencies from 13.7 to 33.2 Hz, pCA1-LEC coherence 
was weaker than dCA1-LEC coherence in the first group of rats (FDR 
corrected, q < 0.05; 12 tetrode pairs; Fig. 1i). The lack of coupling bet- 
ween LEC and pCA1, and between MEC and dCA1, suggests that the 
coherence is mediated by direct LEC-CA1 connections. 

We then asked whether and how oscillatory coupling between LEC 
and dCA1 evolves as the animals learn to use the odour cues to navi- 
gate. Tetrodes were targeted simultaneously to dCA1 and layer III of 
LEC in 5 rats. Once the electrodes were in place, the rats were trained to 
associate different odours with different reward locations, and LFP as 
well as individual cell activity were recorded daily from each region, 
from T1 to T5. Learning was accompanied by a steady increase in 20- 
40-Hz coherence between LEC and dCA1 (Fig. 2a—c and Extended Data 
Fig. 6g; repeated measures ANOVA for all 20 combinations of record- 
ing pairs: F(4, 76) = 35.7, P< 0.001; one recording pair per animal: 
F(4, 16) = 17.9, P< 0.001). At T5, LEC-dCA1 coherence was stronger 
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Figure 1 | 20-40-Hz coupling between LEC and dCA1 during successful 
odour discrimination. a, Box diagram showing recording locations and their 
connections. b, Task chamber. c, Correct choice rate (mean + s.e.m.) as a 
function of training day (all animals). d-k, Data from T5. d, Mean 

speed + s.e.m. (shading) before, during and after odour cue. e, Top, example 
trace of local field potential (LFP) from dCA1. Orange bar, cue period. Grey 
shadings indicate 20-40-Hz oscillations with power >2s.d. above the mean. 
Bottom, time-resolved power spectrum averaged across dCA1 tetrodes 

(all animals). LFP power was normalized to pre-cue power and aligned to cue 
onset (t = 0). Note increase in 20-40-Hz oscillation during cue sampling 
(t(8) = 3.3, P = 0.01, paired t-test). f, Map of task chamber showing spatial 
distribution of 20-40-Hz power change between cue sampling and food 
retrieval (left) and between food retrieval and cue sampling (middle). Right, 
spatial distribution of 20-40-Hz LEC-dCA1 coherence (cue sampling to food 
retrieval). g, Time-resolved power spectra averaged across all tetrodes of all 
rats in LEC (t(9) = 4.8, P< 0.001) and MEC (¢(7) = 1.3, P= 0.24). 


than at Tl for each 0.97-Hz frequency bin between 15.6 and 43.0 Hz 
(Fig. 2b, c; FDR corrected for 62 multiple comparisons at 0-60 Hz, q << 
0.05). Coherence in the theta-band showed no change (Fig. 2c; F(4, 76) 
= 0.79, P = 0.54). The development of interregional coupling was accom- 
panied by gradual cross-frequency coupling between theta and 20- 
40-Hz oscillations in dCA1 (Fig. 2d-g; repeated measures ANOVA for 
mean vector length of 20-40-Hz event distribution across theta phase: 
F(4, 36) = 36.6, P< 0.001), supporting earlier observations in CA3 in an 
item-context association task’”. In LEC, by contrast, 20-40-Hz and theta 
oscillations were phase-coupled from the start of training (Extended Data 
Fig. 7). In both regions, the power of the 20-40-Hz oscillations remained 
unaltered (Extended Data Fig. 6a—d), suggesting that learning is primarily 
associated with increased coupling of pre-existing 20-40-Hz rhythms. 
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h, Time-resolved coherence spectra averaged over all recording pairs in 
LEC-dCA1, MEC-dCA1 and LEC-pCA1. Odour-evoked 20-40-Hz coupling 
is apparent in LEC-dCA1 (cue versus pre-cue: t(19) = 4.4, P< 0.001) but not 
in MEC-dCA1 or LEC-pCA1. i, Mean coherence = s.e.m. (shading) as a 
function of frequency for all LEC-dCA1, MEC-dCA1 and LEC-pCA1 
recording pairs in all animals. Green and blue dots at the top indicate 
frequencies with significant LEC-dCA1 versus MEC-dCA1 and LEC-dCA1 
versus LEC-pCA1 differences, respectively (q < 0.05, FDR corrected, all 
tetrode pairs). j, Percentage of high 20-40-Hz LEC-dCA1 coherence as a 
function of trial time (mean + s.e.m. of all recording pairs). 100-ms blocks with 
coherence exceeding 2s.d. of coherence across the entire session were defined as 
high-coherence epochs. Pale coloured lines, individual rats. k, Percentage of 
high-coherence epochs during cue sampling (cue), running (run1, to food cup; 
run2, back to cue port), and at food cup (food) (repeated-measures ANOVA: 
F(3, 57) = 36.0, P< 0.001; *P < 0.001, Bonferroni post-hoc test). 


To determine whether coupling of 20-40-Hz oscillators in LEC and 
dCA1 is necessary for successful performance, we assessed activity at 
T5 onerror trials (“T5e’; Supplementary Video 1). Results were compared 
to an identical number of correct trials from the same session (down- 
sampled correct trials, ‘T5d’). The coherence of LEC and dCA1 oscillations 
in the 20-40-Hz band decreased significantly on error trials (Fig. 2a—c; 
paired t-test for all combinations of recording pairs, t(19) = 9.81, P< 
0.001; for one pair per animal: t(4) = 5.0, P = 0.007). There was also 
a decrease in cross-frequency coupling in dCA1 (Fig. 2f, g; t(9) = 6.30, 
P<0.001). These findings suggest that 20-40-Hz coupling of dCA1 
and LEC is necessary for successful odour-based navigation. 

The emergence of 20-40-Hz coupling was reflected in individual dCA1 
and LEC cells. A large proportion of the cue-port cells’* in dCA1 were 
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Figure 2 | Evolution of 20-40-Hz coupling between LEC and dCA1 during 
odour-place learning. a—c, Coherence between LEC and dCA1 evolves in 
parallel with learning (average of 20 recording pairs in 5 rats). a, Time-resolved 
LEC-dCA1 coherence spectra. b, Mean coherence ~ s.e.m. as a function of 
frequency. Top, blue and grey dots indicate frequencies at which T1-T5 and 
T5d-T5e differences, respectively, were significant (q < 0.05, FDR corrected). 
c, Coherence for 20-40-Hz (top) and theta frequencies (bottom) from T1 to T5 
and on TS error trials (T5e). T5d, correct T5 trials after down-sampling to 
same number of trials as T5e. Left, mean coherence + s.e.m. (*P < 0.05, 


place cells (Extended Data Fig. 8). The proportion of cue-port cells with 
spike activity significantly phase-locked to local 20-40-Hz oscillations 
increased from 19% to 42% in dCA1, and from 22% to 65% in LEC (T1 
and T5, respectively; Extended Data Fig. 9; binomial tests with Bon- 
ferroni correction, P < 0.005). In dCA1, the phase-locking was reduced 
on error trials (T5e versus T5d: P = 0.03, LEC: P = 0.097). There was 
also an increase from T1 to T5 in cross-regional phase locking (from 
7.9% to 17.4% for dCAI cells versus 20-40-Hz LEC oscillations; from 
13.3% to 24.4% for LEC cells versus dCA1 oscillations; both P < 0.005; 
Extended Data Fig. 9). The match between oscillatory coupling in LFP 
and individual neurons suggests that the change occurred locally in LEC 
and dCA1. 

To establish whether oscillatory coupling evolved as a reflection of 
changes in the animals’ motor behaviour, we monitored the path of the 
rats. There was no change in position and head direction before, during 
or after cue sampling (Extended Data Fig. 2a-f). Power and coherence 
in dCA1 and LEC, as well as 20-40-Hz-theta cross-frequency coupling, 
were significantly larger during the cue interval than during immobility 
at other times (Extended Data Fig. 4), suggesting that the 20-40-Hz 
activity at the cue port is independent of relationships between speed 
and gamma frequency”. 

Could the coupling reflect other non-specific behavioural changes, 
such as enhanced attention to the odour stimuli, which might be accom- 
panied by increased power and coherence of beta-gamma oscillations”””"? 
To dissociate memory from attention, we conducted two control tasks, 
each designed to maintain attention but disrupt memory-based per- 
formance. In both tasks, attention was operationalized by monitoring 
the sniffing frequency of the rat during the cue interval at T5 (Extended 
Data Fig. 2 g-i). In the first test, we added, for each animal, a set of non- 
cued trials (T5n) in which only clean air was delivered at the odour port. 
The increase in 20-40-Hz power was blocked on these trials (Fig. 3a—c; 
power: dCA1: t(8) = 3.50, P = 0.008; LEC: t(9) = 8.40, P< 0.001), as 
was the increase in coherence (reduced compared to T5 at all frequen- 
cies between 16.6 and 34.2 Hz). Sniffing was maintained (Extended Data 
Fig. 2n-o). In the second test, we trained 4 of the animals with two 
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Bonferroni post-hoc test; *P<0.05, paired t-test); right, individual animals 
(colour-coded). d, Example cross-frequency coherence plot showing that 
20--40-Hz oscillatory power (y-axis) was modulated by theta phase (x-axis) in 
dCA1 at T5. Coupling strength is colour-coded. e, Top, time-resolved power 
spectrum averaged across all theta cycles with 20-40-Hz oscillations 

(t = 0, theta trough). Bottom, averaged unfiltered theta cycle. f, Theta 

phase distribution of 20-40-Hz oscillation maxima for LFP from dCA1 
(mean + s.e.m.; 0°, trough). g, Mean vector length of distributions in f (T1 to 
T5 and T5e). *P and *P as inc. 


additional odour pairs (C/D and E/F). After each switch of odour pairs, 
performance dropped temporarily, and then reverted back to criterion 
within three days (Fig. 3d). This development was paralleled by a sig- 
nificant drop in 20-40-Hz coupling between LEC and dCA1 on day 1 
(paired t-test for T5 versus day 1: t(15) = 2.19, P = 0.04) and complete 
recovery by day 3 (F(2, 30) = 4.05, P = 0.03 for C/D and F(2, 30) = 4.54, 
P = 0.02 for E/F) (Fig. 3e and Extended Data Fig. 6e, f). There was no 
change in sniffing frequency during the change of odour pairs (t(3) < 0.5, 
P > 0.6; Extended Data Fig. 2p). These two series of observations sug- 
gest that 20-40-Hz coupling was not primarily a reflection of attention. 

In a final set of analyses, we examined if the changes in spike timing 
were accompanied by changes in the subset of cells recruited by each 
odour. In both dCA1 and LEC, a substantial fraction of the cells de- 
veloped selectivity for one of the odours (Fig. 4a—c). ‘Selectivity’ was 
expressed for each cell as the difference in firing rate during each odour 
divided by the sum of these rates. On individual trials, average select- 
ivity increased during the cue period, mirroring the development of 
20-40-Hz oscillations (Fig. 4a). The development of unique represen- 
tations for left and right odours was confirmed by population vector 
analysis (Fig. 4b, c) as well as receiver-operating characteristic (ROC) 
analysis (Extended Data Fig. 10). Significant changes in selectivity were 
not observed in pCA1 or MEC (data not shown). The selectivity of 
the dCA1 and LEC representations was nearly abolished on error trials 
and non-cued trials (Fig. 4a—c and Extended Data Fig. 10). When anew 
odour pair was introduced, the representations became non-selective 
but reverted to fully discriminative levels by day 3, in parallel with cor- 
rect behavioural choices (Fig. 4f), suggesting that expression of odour- 
specific representations in dCA1 and LEC is necessary for successful 
use of odour cues to guide navigation. 

The development of odour-selective representations in dCA1 and 
LEC was strongly correlated with 20-40-Hz coupling between the two 
regions (Fig. 4d; T1-T5 X 5 animals, all r > 0.48, P< 0.05). LEC selec- 
tivity, 20-40-Hz coupling and task performance developed in parallel 
whereas selectivity in dCA1 lagged behind (Fig. 4e). A repeated mea- 
sures ANOVA on normalized values for each variable, with T1 set to 0 
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Figure 3 | 20-40-Hz coupling on non-cued trials and novel-odour trials. 
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connected cell populations of LEC and dCA1, and that, in each of these 
areas, such coupling coincides with the formation of unique odour 
representations. Error trials and change of odour contingencies were 
invariably accompanied by reduced coupling and reduced ensemble 
selectivity, suggesting that the changes are necessary for successful retriev- 
al or retrieval-based navigation. The results identify 20-40-Hz coupling 
as a potential mechanism for the evolution of functional circuits during 
encoding of associative memory in entorhinal-hippocampal systems. 
Coherent firing between LEC and CA1, timed by 20-40-Hz oscilla- 
tions, may bea prerequisite for the formation of distinct representations 
for differentially associated odour cues, both because coupling pro- 
vides sufficient coincidence of pre- and postsynaptic activity for synaptic 
strengthening to take place’”’ and because coincident firing among affer- 
ent neurons facilitates such strengthening™. Interregional coupling may 
facilitate the pattern-completion processes required for successful reac- 
tivation of newly formed entorhinal and hippocampal cell ensembles. 

20-40-Hz oscillations occur widely across the cortex”””*. During ol- 
factory learning, 20-40-Hz oscillations may coordinate activity across 
distributed areas spanning from the olfactory bulb to the LEC and the 
hippocampus. Increases in 20-40-Hz rhythms in the olfactory bulb and 
olfactory cortex have been observed after olfactory learning’*”®, in par- 
allel with 20-40-Hz field oscillations in the hippocampus"*. Widespread 
20-40-Hz oscillations have also been observed to emerge at late stages 
of habit learning in the ventromedial striatum”. Because 20-40-Hz oscil- 
lations can maintain synchrony with larger conduction delays than faster 
gamma oscillations*”*, they are particularly suitable for linking activity 
across widely distributed brain regions such as those participating in 
olfaction-based memory formation. The strong coincidence of entorh- 
inal-hippocampal 20-40-Hz coupling and the emergence of olfactory 
neural representations in LEC and dCA1, in parallel with behavioural 
learning, points to 20-40-Hz oscillations as a key ingredient of the mech- 
anism for induction and expression of long-term memory in distrib- 
uted cortical circuits. 


METHODS SUMMARY 


Seventeen male Long-Evans rats were trained to perform an odour-place asso- 
ciation task. The rats sampled odours for 1 s in an odour port, after which food was 
delivered in one of two food cups whenever the rat chose the food cup associated 
with the presented odour. The food cups were placed at the other end of the test 
chamber. Behavioural testing was controlled by custom software triggering the valves 
of the olfactometer as well as food-delivery. 

The rats were implanted either with a ‘hyperdrive’ carrying 14 tetrodes targeted 
to cover almost the entire proximodistal extent of dorsal CA1 (n = 5 rats), or with 
two ‘microdrives’ carrying 4 tetrodes targeted, respectively, to the distal part of CA1 
and either the anterodorsal part of LEC (5 rats) or the dorsal part of MEC (4 rats). 
In three rats, microdrives were targeted simultaneously to LEC and the proximal 
part of CA1. A multi-tetrode parallel recording system (Neuralynx or Axona) was 
used to obtain local field potentials and spike activity from individual cells, to mon- 
itor animal behaviour, and to time odour and food delivery. In rats with implants 
in dCA1 and LEC, implantation and tetrode turning were performed prior to the 
main part of the behavioural training (T1 to T5). 

Analyses were focused on the ‘cue sampling period’ during which the rats stood 
still and actively sampled odour cues at the cue port. Because all animals were trained 
and tested similarly, there was no randomization or blinding. The experiments were 
performed in accordance with the Norwegian Animal Welfare Act and the European 
Convention for the Protection of Vertebrate Animals used for Experimental and 
Other Scientific Purposes. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Subjects. Data were obtained from 17 male Long Evans rats. In 5 of the rats, the 
tetrodes were implanted along almost the entire proximodistal extent of CA1 using 
an oval-bundle ‘hyperdrive’ consisting of 14 independently movable tetrodes”. Five 
of the rats were implanted with two microdrives carrying a single bundle of 4 tet- 
rodes targeted, respectively, to anterodorsal LEC and distal CA1. Four rats were 
implanted with two microdrives targeted, respectively, to dorsal MEC and distal 
CAL. Finally, three rats were implanted with two microdrives aimed, respectively, 
at anterodorsal LEC and proximal CA1. Sample size was determined as the min- 
imal number of animals that would provide statistical power to detect a group dif- 
ference. We used all animals in which tetrodes hit the target regions. We did not use 
randomization to allocate animals to experimental treatments and the experimen- 
ters were not blind to the identity of the experimental groups. 

All animals were 3-5 months old (450-600 g) at the time of implantation. After 
surgery, they were housed individually in Plexiglas cages (45 X 44 X 30cm) ina 
humidity- and temperature-controlled environment. The rats recovered from sur- 
gery for at least one week before odour-place training. During training, the rats 
were put on a food deprivation regimen that kept them at 80-90% of their free- 
feeding body weight. The animals were kept ona 12-h light/12-h dark schedule. All 
testing occurred during the dark phase. The experiments were performed in accor- 
dance with the Norwegian Animal Welfare Act and the European Convention for 
the Protection of Vertebrate Animals used for Experimental and Other Scientific 
Purposes. 

Electrode implantation and surgery. Tetrodes were constructed from four twisted 
17 um polyimide-coated platinum-iridium (90%-10%) wires (California Fine Wire). 
The electrode tips were plated with platinum to reduce electrode impedances to 
between 150 and 300 kQ at 1 kHz. During implantation surgery, the animals were 
anaesthetized with isoflurane. Air flow was kept at 1.21 min” ' with 0.5-3% (v/v) 
isoflurane as determined by physiological monitoring. Local anaesthetic (Xylocaine) 
was applied on the skin before making the incision. Holes were drilled on the dorsal 
skull, after which the rat was implanted with either two microdrives or a hyperd- 
rive. Hyperdrives for CA1 were implanted in the right hemisphere. Microdrives for 
dCA1/pCA1 and LEC/MEC were implanted in the right and left hemispheres, 
respectively. Bihemispheric implants were used because layer III EC cells have both 
ipsi- and contralateral projections to CA1'**°. Hyperdrive tetrodes were implanted 
vertically between AP 3.4 and 4.9 mm, and ML 2.1 and 3.7 mm, relative to bregma 
in order to cover as much of the proximodistal axis of CA1 as possible”. For micro- 
drive implants, one drive was implanted either in the distal or the proximal part of 
CAI (AP 4.2, ML 3.3 and AP 3.2, ML 4.0, respectively, distance from brain surface 
1.5mm, tetrode tips pointing medially at an angle of 30° in the coronal plane). The 
other drive was implanted in either dorsal anterior LEC (AP 6.1, ML 5.0, distance 
from brain surface 4.5 mm, with tetrode tips pointing laterally at an angle of 16° in 
the coronal plane) or in the dorsomedial quarter of the MEC (4.5 mm lateral to 
midline, 0.2-0.3 mm anterior to the transverse sinus, distance from brain surface 
1.5 mm, tetrode tips pointing in the anterior direction at an angle of 10° in the sag- 
ittal plane). Hyperdrives and microdrives were fixed to the skull with jewellers’ screws 
and dental cement. Two screws inserted above the cerebellum were used as ref- 
erence and ground electrodes during recordings. For recording of sniffing beha- 
viour, a temperature sensor (a 0.005-inch, Teflon-coated thermocouple, Omega, 
5TC-TT-K-36-36) was implanted in the right nostril through a hole made in the 
dorsal skull. The implant was secured with dental cement. Thermocouples were 
inserted into the nasal passage through the nasal bone just rostral to the turbinates. 
Tetrode placement. Over the course of ~2-3 weeks, the tetrodes were lowered in 
steps of 200 tum or less until well-separated, large-amplitude, low-frequency neu- 
rons appeared in the CA1 pyramidal cell layer or in LEC layer III at a depth of 
~7.0mm. In MEC, tetrodes were targeted at layer III at a depth of ~2.0-4.0 mm. 
Tetrode positions were maintained across sessions, except in LEC and MEC, where 
the tetrodes were lowered 100 tm to record from a new set of neurons after the 
recordings at time point T5 were completed. All tetrode locations were verified 
histologically at the end (Extended Data Fig. 1). Only animals with tetrodes in the 
regions of interest were used in this study. 

Behavioural task. Cue-place association task. All rats were tested in an odour place 
association task (Supplementary Video 1). The test box (50 X 50 X 40 cm) con- 
tained a cylindrical port for odour delivery (‘cue port’, 2.9 cm internal diameter, 
i.d.), and two circular food cups for food delivery (4.5 cm i.d.). Entry and exit from 
odour ports and food cups were detected via an infrared photo-beam located inside 
the odour port and the food cups. Three pairs of monomolecular odorants (Sigma- 
Aldrich) were used as odour cues. Two odours (A/B) were used during the initial 
stage, of the task (A = isoamyl acetate (banana odour) / B = o-pinene (pine tree odour)). 
For subsequent stages with novel odours, we used t-anethol (anis odour) as C and 
geraniol (rose odour) as D, and undecanoic y-lactone (peach odour) as E and benz- 
aldehyde (almond odour) as F. All odours were diluted in mineral oil (Sigma) to a 
final concentration of 5% (v/v). 


Odour cues were delivered using an olfactometer (PHM-275, Med Associates, 
St. Albans, VT, USA) housed in a shielded box and located next to the behavioural 
box. Computer-controlled solenoid valves triggered flow of individual channels 
of odorized air (1.01 min ') diluted in charcoal-deodorized air (1.01 min '). The 
odorized air was delivered from the olfactometer to the cue port via Teflon tubing 
(1.5mm id. X 70cm). Rats self-initiated each experimental trial by introducing 
their snout into the cue port. After a random delay of 0.15-0.25 s, either of two 
odour cues was delivered. The order of odours was altered pseudorandomly across 
trials. A tone signal was delivered 1.0s after cue initiation. If the rat stayed in the 
cue port until the tone signal, food reward was made available in one of the food 
cups. Odours A, C and E were associated with food in the left cup and odours B, D 
and F with food in the right cup. Withdrawal of the nose from the cue port termi- 
nated the delivery of odour. Rats usually withdrew from the odour port within 1.0- 
1.5s and then moved to the cue-associated food cup. If the rat withdrew its nose 
from the cue port before the tone signal, no food reward was delivered in any of the 
cups. Correct choices were rewarded with a 45 mg sucrose pellet delivered via a food 
dispenser (EVN-203-45IR, Med Associates) as soon as the rat broke the photobeam. 

Training procedure. Animals were first habituated to the behavioural apparatus 
and the experimenter for one week and then pre-trained for another week to per- 
form nose pokes at the cue port in the absence of odour cues. Nose poking was 
followed by running to the food cups and consumption of food rewards. After com- 
pletion of pre-training, odour cues A/B were introduced in a pseudorandom order 
during the cue sampling interval and the rats were trained to associate one odour 
with the left food cup and the other with the right cup by being rewarded only for 
appropriate choices (Fig. 1b). Training was terminated when the rats reached to a 
criterion of 85% correct rate for two consecutive sessions (Fig. 1c, 16.8 + 0.8 days). 
Days 2, 6, 10, 14 and 18 after odour introduction were defined as time-points 
T1-TS, respectively (Fig. 1c). The animals were trained on at least two sessions of 
30 min daily, usually with ~60 trials per session. Surgery was performed after pre- 
training. For 5 dCA1-LEC rats, associative training sessions were performed after 
the tetrodes were in place at the regions of interest. 

Non-cued task. After the animals reached the criterion, they were tested in a 
non-cued control task, in which only flow of vehicle pure air without odour cues 
was delivered when the animal made the nose poke. After the poke, the animals 
made random runs to either of the two foods ports, 85% of which were randomly 
rewarded. Non-cued trials were scheduled after cued sessions each day after the rat 
had reached the criterion. 

Novel cue task. In four rats with simultaneous implants in dCA1 and LECs, train- 
ing was continued with odour cues C/D, using a similar protocol as for A/B. Cri- 
terion was reached within 3 days. The same rats were subsequently trained for cues 
E/F for another 3 days. 

Random foraging task. On each testing day, after the cue-place association task, 

the rats were tested in a random foraging task in an open field (100 X 100 cm) to 
estimate spatial tuning of the cells. The rats were trained for at least one session of 
10-20 min. Running was motivated by chocolate cereal crumbs or vanilla cookie 
crumbs thrown randomly into the enclosure. 
Data collection. Spike activity was recorded against reference electrodes located at 
the level of the corpus callosum or higher up for hyperdrive recordings, and against 
tetrodes with no spike activities in the same region for microdrive recordings. Local 
field potential (LFP) was always recorded single-ended against a ground in the skull 
above the cerebellum. 

The tetrode drive was connected to a multichannel, impedance matching, unity 
gain headstage. The output of the headstage was conducted to an Axona acquisi- 
tion system (Axona Ltd) or toa Neuralynx data acquisition system (Neuralynx) via 
alightweight multiwire tether cable and through a slip-ring commutator. Unit acti- 
vity was amplified by a factor of 3,000-5,000 and band-pass filtered from 800 to 
6,700 Hz (Axona) or from 600 to 6,000 Hz (Neuralynx). Spike waveforms above a 
threshold set by the experimenter (~55 j1V) were time-stamped and digitized at 
32 kHz (Neuralynx) or 48 kHz (Axona) for 1 ms. 

For the hyperdrive recordings in CA1, LFP signals, 1 per tetrode, were recorded 
in the 0-475-Hz frequency band at a sampling rate of ~2,000 Hz (1,893 Hz sam- 
pling and 2,034 Hz sampling for analogue and digital Neuralynx systems, respect- 
ively). For the microdrive recordings in dCA1 and LEC, LFP signals from two 
tetrodes per microdrive were amplified 1,000-3,000 times, lowpass-filtered at 500 Hz, 
and sampled at a rate of 4,800 Hz (16 bits per sample). This allowed us to analyse 
the dynamics of oscillatory coherence between dCA1 and LEC for 2 X 2 = 4 ‘record- 
ing pairs’ in each animal. In a more restricted analysis, we used only one pair from 
each animal — the pair that showed the highest power in the 20-40-Hz frequency 
band (assuming that coherence could be estimated most accurately from this tetr- 
ode). Notch filters were not applied. Signals from thermocouples were recorded as 
described for LFP signals. Light-emitting diodes (LEDs) on the headstage were used 
to track the rat’s movements at a rate of 25 Hz (Neuralynx) or 50 Hz (Axona). 
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Data analysis. Unless indicated otherwise, analyses were performed using MATLAB 
code written by the authors. Methods for spectral and wavelet analysis were de- 
scribed previously’*. Analyses were performed for periods of interest (that is, the 
cue sampling period at the cue port, runs from cue port to food cups, or runs from 
food cups to the cue port) during individual trials and then averaged across mul- 
tiple trials within a session. Unless otherwise indicated, data from the cue sampling 
period were taken from the second half (500-1,000 ms) to reflect the delayed onset 
of changes in LFP and spike activity (Figs le, g, h and 2a, and Extended Data Fig. 9a, e). 
Data were normalized against the pre-cue period, when the animal was stationary 
at the odour port before the cue was delivered (150 to 0 ms before cue onset). To be 
sure that no odour sampling occurred during the baseline period, we included only 
trials where the animal’s snout was outside the odour port for at least 1 s before the 
trial. In time-resolved representations, data were averaged by setting the timing of 
odour cue onset as a trigger. For statistical tests, we used parametric tests since the 
data did not deviate substantially from a normal distribution. t-tests were performed 
when there was similar variance between groups. s.e.m. are provided for all analyses. 
LEP analysis. Time-frequency analysis of power. Time-resolved power across fre- 
quencies was computed using a wavelet transform method. Signals were convolved 
by a family of complex Morlet’s wavelets w(t, f), one for each frequency, as a func- 
tion of time: 


w(t,f) =A exp (—t?/20?) exp (2inft) 


in which the family of wavelets was characterized by a constant ratio f/og which 
was set to 7, and with o¢= '/10;. The coefficient A was set at: 


(or/n)~? 


in order to normalize the wavelets such that their total energy was equal to 1. 

To control for impedance differences between tetrodes, LFP power was normal- 
ized, for each tetrode, to power during the pre-cue period (150 to 0 ms before cue 
onset) when the animal was stationary at the odour port before the cue was deliv- 
ered. Normalized power was averaged across trials that did not include prema- 
turely terminated trials (<1s) during the 1s period preceding the onset of cue 
sampling. 

Coherence. Coherence was computed using a multitaper method*! from the 
Chronux open source MATLAB toolbox (http://chronux.org/). Recordings were 
cut into non-overlapping, 5 s long windows around the cue sampling event. Results 
for multiple trials in a single session were averaged. We typically used window sizes 
of 0.4-1s and 3-5 tapers. To extract periods of high 20-40-Hz coupling, we first 
computed time-varying coherence within the 20-40-Hz band using 100-ms bins. 
Bins were collected when coherence exceeded 2s.d. of the time-averaged coherence. 

Bandpass-filtering. An acausal (zero phase shift), frequency domain, finite im- 
pulse response bandpass filter was applied to the signals. For theta filtering, 5 and 
6 Hz were chosen for stopband and passband, respectively, for low cut-off frequen- 
cies; 10 and 11 Hz were chosen for passband and stopband high cut-off frequen- 
cies. For filtering of 20-40-Hz oscillations, 18 and 20 Hz were chosen for the stopband 
and passband, respectively, for the low cut-off frequencies; 40 and 42 Hz were 
chosen for passband and stopband high cut-off frequencies. 

Selection of theta cycles. Theta cycles were selected by bandpass filtering the 
signal from 6-10 Hz and selecting local minima in the filtered signal (that is, 
O(t — 1) > O(#) and O(t + 1) > O(Z)). Segments of the recording were collected and 
defined as a theta cycle if the time between detected points fell within a range 
criterion that corresponded to the period of an ~8-Hz theta cycle (that is, 125 + 
25 ms). Local minima of detected theta cycles were required to be separated by at 
least 100 ms. 

Detection of oscillation episodes. To extract periods of 20-40-Hz oscillatory 
activity in the LFP, we first computed time-varying power within the frequency 
bands for each recording. Power at each time point was averaged across the 20- 
40 Hz frequency range to obtain time-varying estimates of oscillatory power. Time 
points were collected when the power exceeded 2s.d. of the time-averaged power. 
Time windows, 160 ms in length, were cut around the identified time points. In 
each 160-ms segment, the maxima of 20-40-Hz oscillatory amplitude were deter- 
mined from the 20-40-Hz bandpass filtered versions of the recordings. Duplicated 
20-40-Hz oscillatory periods, a common consequence of extracting overlapping time 
windows, were avoided by discarding identical maxima values within a given 20- 
40-Hz oscillatory subtype and further requiring that maxima ofa given subtype be 
separated by at least 100 ms. Individual 20-40-Hz oscillatory windows were finally 
constructed from the original, non-bandpass filtered recordings as 400 ms long 
windows centred around the 20-40-Hz oscillatory amplitude maxima. 

Relationship of 20-40-Hz oscillations to theta phase. LFP recordings were bandpass- 
filtered in the theta range (6-10 Hz), and theta phases for each time point were esti- 
mated using the Hilbert transform function from the Signal Processing Toolbox in 
MATLAB. Theta phases at the time points associated with 20-40-Hz maxima (deter- 
mined as described above) as well as theta phases for CA1 place cell spikes were 
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collected. Theta phases for each 20-40-Hz oscillatory event were sorted into 30° 
bins, allowing the phase distribution of each event to be determined. For a given 
recording, the distributions of 20-40-Hz oscillations were normalized by dividing 
the bins by the total number of 20-40-Hz oscillatory episodes within a given record- 
ing. The normalized distributions were averaged across recordings. In this analysis, 
and in all analyses involving oscillation phase, the oscillation trough was defined 
as 0°. 

Cross-frequency analysis. The time-varying power in a particular frequency band 
of the LFP (from 2-200 Hz, in 2 Hz wide frequency bins) was calculated using a 
Morlet’s wavelets technique (see above). Coherence was estimated between the orig- 
inal signal and the time-varying power of the signal for each frequency of interest 
using the magnitude squared coherence function (Hanning window with 50% over- 
lap; 16,384 and 4,096 FFT points for Axona and Neuralynx data, respectively) from 
the Signal Processing Toolbox in MATLAB. This provided a measure for deter- 
mining if power changes at particular frequencies (for example, 20-40 Hz) were 
correlated with phase at other frequencies (for example, theta). 

Time-frequency representation of power across individual theta cycles. Time- 
varying power in 2-Hz-wide frequency bands, from 2 to 100 Hz, was obtained for 
individual theta cycles using the wavelet transform method described above. Time 
frequency representations for multiple theta cycles recorded from the same site 
and session within the same animal were then averaged. 

Prewhitening. For illustrations of power shown in Fig. 2e, a prewhitening auto- 
regressive filter (order of 5) was applied before computation of time frequency anal- 
ysis to de-emphasize the lower frequency end of the spectrum for illustration purposes. 
Filter coefficients were estimated using a Yule- Walker approach. Prewhitening was 
not applied in any of the analyses; it was only used to better visualize power at high 
frequencies in the selected example above. 

Speed-frequency correlation. The relationship between running speed and LFP 
frequency was estimated following procedures described previously’. Briefly, we 
first calculated the absolute power spectrum of LFP during each 200 ms time win- 
dow. Relative power was then calculated by determining a z-score for each 0.49 Hz 
frequency bin across successive time windows of individual sessions. Finally, speed 
was correlated with peak frequency for each speed bin. 

Sniffing analysis. Sniffing was measured by thermocouples implanted in the right 
nostril of the animals”. In each breathing cycle, inhalation is associated with decay- 
ing voltage output from the thermocouple, and exhalation with increasing voltage 
output. Signals from the thermocouples were bandpass-filtered at 0.5 and 1.5 Hz 
for stopband and passband, respectively, for low cut-off frequencies, and at 44.5 
and 45.5 Hz for passband and stopband for high cut-off frequencies. Troughs of the 
signals were detected and the duration of individual sniffs was determined. The 
inverse of individual sniffing periods was taken as the instantaneous frequency of 
the sniffing. 

Spike analysis. Spike sorting and cell classification. Spike sorting was performed 
offline using graphical cluster-cutting software described previously». Putative 
excitatory cells were distinguished from putative interneurons by spike width and 
average rate. Firing rate maps (see below) and head direction tuning curves were 
used for classification of cell types in MEC****. Putative excitatory cells with mean 
firing rates of more than 1.0 Hz (mean firing rate during all left and right cue sam- 
pling intervals) were included for further analysis. 

Since the tetrodes were not moved during task acquisition, most of the units 
included in the recordings at time points T1-T5 were from the same cell popu- 
lation. However, occasionally some cells were lost during the course of learning 
whereas other cells were added, causing slight variation in the number of the cells 
from session to session (from T1 to T5: between 63 and 75 cells in dCA1 and between 
76 and 82 cells in LEC). Cells on error trials (T5e) were from the same population as 
recorded on correct T5 trials and on down-sampled T5 trials (T5d). 

Phase-locking of spikes to 20-40-Hz oscillations. Spikes that occurred during 
20-40-Hz oscillatory episodes were selected. For these oscillation-associated spikes, 
oscillatory phase at the time of spike occurrence was estimated. This was performed 
by first bandpass-filtering the LFP, performing a Hilbert transform on the filtered 
signal, and then extracting the phase component at the spike times. Cells were con- 
sidered to be phase-locked if their phase distribution differed significantly from a 
uniform distribution (P < 0.05, Rayleigh test). 

Selectivity index. Odour-specific representation of spikes was assessed using a 
selectivity metric. Selectivity was computed by comparing firing rates during sam- 
pling for left-associated and right-associated cues on successive 100-ms bins: 


Selectivity = (FR; — FR.) /(FRi + FR2) 


where FR, and FR; are mean firing rates across multiple trials in individual 100-ms 
bins during left- and right-associated cue samplings, respectively. 

To confirm the results with this difference-over-sum index, we also computed 
Selectivity using receiver-operating characteristic (ROC) analysis*’, a method based 
on signal detection theory: 
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Selectivitygocg =2 x (auROC—0.5) 


where auROC is the area under the ROC curve computed from spike numbers for 
left and right trials in individual 100-ms bins. auROC = 0.5 indicates that the firing 
rates were not different between left and right trials; auROC = 1 indicates that firing 
rates in all left-cue sampling trials were larger than all right-cue sampling trials (see 
Extended Data Fig. 10). 

After scaling, for both Selectivity metrics, Selectivity = 1 indicates that all firing 
occurs during the left-associated cue odour, whereas Selectivity = —1 means that 
all firing occurs during right-cue odours. 

Population vector analysis. To estimate the development of odour-specific re- 
presentations for a whole population of cells, population vector (PV) analysis was 
performed as described previously**”’. Mean firing rates in individual 100-ms bins 
were pooled for cells from all animals to construct vectors for left and right- 
associated odours. A correlation coefficient (CC) for the two vectors was computed 
and then re-scaled as a PV Differential Index (PDI) to emphasize the difference 
between the population vectors as 


PDI=1—CC 


PDI = 0 indicates that the population vectors were identical, whereas PDI = 1 de- 
notes that activity was maximally different between left and right-associated trials. 

The chance level of the PDI was determined by a random permutation proce- 
dure using all cells recorded at that time-point in each region. One thousand per- 
mutations were performed for each cell in the sample. For each permutation trial, 
individual cue-sampling trials were randomly re-assigned as left or right-associated 
trials, maintaining the number of left and right trials identical to the original data. 
Population vectors were computed from the same ensemble of the cells as the orig- 
inal data for each of 1,000 permutations, resulting in 1,000 sets of PDI. The distri- 
bution of PDI across all 1,000 permutations in the sample was computed and finally 
the 95th percentile was determined as the chance level. 

Firing rate maps. Position estimates were based on tracking of the LEDs on the 
head stage connected to the tetrode drive. For rate maps in the random foraging 
task, data were speed-filtered; only epochs with instantaneous running speeds of 
5cms ' or more were included. 

To characterize firing fields, the position data were sorted into 2.5 cm X 2.5cm 
bins. The path was smoothed with a 21-sample boxcar window filter (400 ms; 10 
samples on each side). Firing rate distributions were then determined by counting 
the number of spikes in each bin as well as the time spent per bin. Maps for number 
of spikes and time were smoothed individually using a boxcar average over the 
surrounding 5 X 5 bins. Weights were distributed as follows: 


0.0025 0.0125 0.0200 0.0125 0.0025 
0.0125 0.0625 0.1000 0.0625 0.0125 
0.0200 0.1000 0.1600 0.1000 0.0200 
0.0125 0.0625 0.1000 0.0625 0.0125 
0.0025 0.0125 0.0200 0.0125 0.0025 


boxcar weights = 


Spatial information. For each cell, the spatial information content in bits per spike 
was calculated as 


: ‘ Ai Ki 

information content = y Pi-log, — 
- A A 
1 


where /; is the mean firing rate of a unit in the ith bin, / is the overall mean firing 
rate, and p; is the probability of the animal being in the ith bin (occupancy in the ith 
bin / total recording time)*°. An adaptive smoothing method, introduced by Skaggs 
et al.*!, was used before the calculation of information scores”. 

Place cells were defined as cells with spatial information content above the chance 
level for this parameter’. The chance level was determined for each time-point 
during learning in each brain region by a random permutation procedure using all 
cells recorded at that time-point in that region. One hundred permutations were 
performed for each cell in the sample. For each permutation trial, the entire sequence 
of spikes fired by the cell was time-shifted along the animal’s path by a random 
interval between 20s and 20s less than the total length of the trial (usually 600 — 
20 = 580s), with the end of the trial wrapped to the beginning to allow for circular 
displacements. This procedure allowed the temporal firing structure to be retained 
in the shuffled data at the same time as the spatial structure was lost. Spatial infor- 
mation was then calculated for each shuffled map. The distribution of spatial infor- 
mation values across all 100 permutations of all cells in the sample was computed 
and finally the 95th percentile was determined. Place cells were defined as cells with 
spatial information scores above the 95th percentile of the distribution from shuf- 
fled data for the relevant group. 

Gridness score. The structure of all rate maps was evaluated for all cells by cal- 
culating the spatial autocorrelation for each smoothed rate map*****’. The degree 


of spatial periodicity (‘gridness score’) was determined for each recorded cell by 
taking a circular sample of the autocorrelogram, centred on the central peak but 
with the central peak excluded, and comparing rotated versions of this sample”. 
Grid cells were defined as cells in which rotational symmetry-based grid scores 
exceeded the 95th percentile of a distribution of grid scores for shuffled recordings 
from the entire population of cells. Shuffling was performed in the same way as for 
place cells by time-shifting the spike sequence for each permutation trial. The shuf- 
fling procedure was repeated 100 times for each of the cells in the sample. Grid 
scores were computed for all permutations for all cells and the 95th percentile was 
read out from the overall distribution. Grid cells were defined as cells with grid 
scores higher than the 95th percentile of the grid scores of the distribution for the 
shuffled data. 

Histology and 3D reconstruction. After receiving an overdose of pentobarbital, 
the rats were perfused intracardially with saline and 4% formaldehyde. The brains 
were then removed and stored in formaldehyde. For verification of tetrode loca- 
tions, the brains were frozen, cut at 30 um thickness, and stained with cresyl violet. 
For hyperdrive recordings, all tetrodes of the 14-tetrode bundle were identified, and 
the tip of each electrode was found by comparison with adjacent sections. For micro- 
drive recordings, the positions of the bundle of four tetrode tips were determined. 
Digital photomicrographs were acquired with a Zeiss Axoimager-Z1 microscope 
equipped with a digital camera using AxioVision software. For CA] recordings, all 
tetrodes were identified in the pyramidal cell layer of the CA1. For LEC and MEC 
recordings, all tetrodes were identified in layer III. In some animals with LEC or 
MEC microdrives, LFP recordings were moved for 100 um to acquire additional 
cells after completion of T5 recordings with the original tetrode locations. In these 
cases, the exact position of the electrodes at the time of the original recording was 
extrapolated”. 

For CAI recordings, the positions of the tetrodes were three-dimensionally recon- 
structed and analysed using as described previously’. Tetrode positions and struc- 
ture of cell layers for each section in the dorsal hippocampal region were digitally 
registered using a computer-assisted system (Neurolucida, MBF Bioscience) accord- 
ing to the manufacturer’s instructions. The tracing output from Neurolucida was 
imported into MATLAB (Mathworks) for further analyses. To place electrode posi- 
tions in identical coordinates and minimize the distortions originating from sec- 
tioning and reconstruction, reconstructed neurons were linearly transformed and 
fitted into reference brain coordinates. A reference brain was first reconstructed. 
Three landmarks for transformation were used: the anterior tip of subiculum (tSUB), 
the anterior tip of the lateral ventricle, and the ‘X’-shaped pyramidal pool of CA3 
in the intermediate portion of the hippocampus”. These landmarks were selected 
because (1) they were distributed at the periphery of the dorsal hippocampus and 
(2) they could be uniquely defined and were unaffected by rotation of the brain. 
The coordinate system was defined as giving the tSUB coordinates of [0, 0, 0] in the 
[x (lateral), y (anterior) and z (dorsal)] directions (in jum). Individual sample brains 
with the tetrode tip positions were then fitted to the reference brain with regard to 
the above landmarks using affine (linear) transformation (translation, rotation, scal- 
ing, and shear). The surface of CA1 pyramidal cell layer was then reconstructed by 
interpolating reconstructed cell layer data. The major (longest) axis of the CA1 cell 
layer was defined as the septotemporal axis of the hippocampus. For each tetrode 
tip, distal-proximal (DP) distance was calculated three-dimensionally by the fol- 
lowing procedure: a transverse plane that includes the tetrode position and is per- 
pendicular to the septotemporal axis is determined. An intersectional curve between 
this transverse plane and the CA1 surface is then calculated. DP distance Dpp was 
defined as Dpp = Lpry/Lpp, where Lpy was the length of the curve between the dis- 
tal ending point of the intersectional curve and the tetrode tip position, and Lpp 
was the length between the distal and proximal two ending points along the curve. 
Dpp = O indicates the distal border, whereas Dpp = 1 indicates the proximal border. 
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Extended Data Figure 1 | Brain sections showing recording positions in 
CA1, LEC and MEC. a, Recording positions (red) in CA1 in coronal brain 
sections from representative animals (top; animal and tetrode numbers 
indicated; distal-proximal (DP) distances as in Extended Data Fig. 5) and in a 
projection of a 3D reconstruction of the CA1 pyramidal cell layer (bottom; 
see Extended Data Fig. 5 for the full 3D reconstruction). Abscissa shows the 
lateral-medial axis and ordinate the anterior—posterior axis. Dorsal—ventral is 
colour-coded. Each dot corresponds to one recording location in distal (red), 
intermediate (green) and proximal (blue) parts of CA1. Stippled line indicates 


septotemporal axis of the hippocampus (see Supplementary Methods). 

b, Tetrode positions (red) in coronal sections through LEC for all 5 animals 
with simultaneous recordings in LEC and dCA1. Animal numbers are 
indicated. ¢, Tetrode positions (red) in sagittal sections through MEC (all 4 
animals with simultaneous MEC and dCA1 recordings). d, Tetrode positions 
(red) in coronal brain sections through LEC from all 3 animals with 
simultaneous recordings in pCA1 and LEC. Orientation: D, dorsal; L, lateral; 
A, anterior. 
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Extended Data Figure 2 | Learning was not associated with changes in 
motor behaviours. a, Left, example trajectories of one animal on sessions T1 to 
T5, on error trials (T5e) and on correct trials downsampled to the same number 
of trials as the error trials (T5d). Red parts of each trajectory indicate positions 
covered during the cue sampling interval, black indicates positions during 
runs from cue port to food cups, and grey shows positions from food cups back 
to the cue port. Right, polar plots showing distributions of head direction 
during the cue sampling interval (T1-T5, T5d and T5e, as indicated to the left). 
Values to the right indicate mean head angle relative to the centre of the cue 
port, designated as north (0°). b, Time course of instantaneous speed before, 
during and after cue sampling, averaged across 5 rats. Shading denotes s.e.m. 
T1is shown at the top; T5, T5d and T5e at the bottom. For every 10-ms time bin 
during T2-T5, speed was compared with the corresponding bin at T1. Speeds 
were not significantly different from T1 at any time bin (q > 0.05; false 
discovery rate (FDR) corrected for multiple comparisons). Speed was not 
different at T5e and T5d. c, Mean head angle (top) and mean vector length for 
head direction (bottom) during the cue sampling interval (mean + s.e.m. for all 
5 animals). Neither head angle nor mean vector length changed significantly 
from T1 to T5 (repeated measures ANOVA: F(4, 16) = 0.44, P = 0.78 for mean 
angle; F(4, 16) = 0.13, P = 0.97 for mean vector length). There was also no 
change in these parameters on the error trials (T5e compared with T5d using 
two-tailed paired t-test: t(4) = 0.70, P = 0.52 for mean angle; t(4) = 0.18, 

P = 0.87 for mean vector length). d, Mean instantaneous speed (top), path 
length (middle) and run duration (bottom) for trajectories from cue port to 
food cups (left column) and from food cups to cue port (right column). None of 
these parameters changed significantly during learning (repeated measures 
ANOVA for time-points T1-T5; for cue port to food cups: F(4, 16) = 0.14, 
P= 0.97 for mean speed, F(4, 16) = 0.55, P = 0.70 for path length, F(4, 16) 

= 0.16, P = 0.95 for run duration; for food cups to cue port: F(4, 16) = 0.35, 
P= 0.84 for mean speed, F(4, 16) = 0.47, P = 0.76 for path length, F(4, 16) 

= 0.13, P = 0.97 for run duration). None of these parameters changed on error 
trials (T5e compared with T5d using two-tailed paired t-test; cue port to food 
cups: t(4) = 0.31, P = 0.77 for mean speed, t(4) = 1.87, P = 0.52 for path 
length, t(4) = 0.91, P = 0.42 for run duration; food cups to cue port: t(4) = 0.14, 
P= 0.89 for mean speed, t(4) = 0.50, P = 0.64 for path length, t(4) = 0.02, 
P= 0.99 for run duration). e, To check if changes in neural activity from T1 to 
T5 are associated with changes in body position, LEDs were attached to the 
back of 4 rats and LED positions were recorded from T1 to T5. Left, example 
trajectories of body position in one animal at different stages of learning. Red 
indicates body positions covered during the cue sampling interval, black 
indicates positions during runs from cue port to food cup, and grey show 
positions from food cup back to the cue port. Right, polar plots showing 
distribution of body direction (deflection from south) during the cue sampling 
interval (T1-T5). f, Cumulative body movement (top), mean body angle 
(middle) and mean vector length for body direction (bottom) during the cue 
sampling interval (mean + s.e.m. of 4 animals). In the middle panel, body angle 
is shown as the absolute change in mean angle compared to T1, since different 
rats may turn in different directions. None of these parameters changed 
significantly from T1 to T5 (repeated measures ANOVA: F(4, 12) = 0.77, 
P= 0.57 for cumulative body movement; F(3, 9) = 1.06, P = 0.41 for mean 
angle; F(4, 12) = 1.11, P = 0.40 for mean vector length). g, Example recording 


showing changes in sniffing amplitude and frequency during cue sampling. 
A temperature sensor (thermocouple) was implanted in the right nostril to 
measure respiration (top). In each breathing cycle, inhalation is associated with 
decaying voltage output from the thermocouple, and exhalation with increasing 
voltage. Instantaneous sniff frequency was determined from the voltage change 
(bottom). Note that sniff frequency increased during cue sampling (between 
t= 0sandt= 1s) and when the animal approached the food cup. The food cup 
was reached at t = 3.8 s in the present example. h, Time course of sniff 
frequency averaged for 5 rats at time-points T1-T5 as well as on error trials 
(T5e) and correct trials down-sampled to the same number of trials (T5d). 
Shading denotes s.e.m. At time points T2-T5, sniff frequency was compared for 
every 10 ms bin with corresponding bins at T1. None of the comparisons 
were significantly different (q > 0.05; FDR corrected for multiple 
comparisons). Sniff frequencies between T5d and T5e were also not different. 
i, Mean sniff frequency during the cue sampling interval. Sniff frequency did 
not change during the course of learning (repeated measures ANOVA: 

F(4, 16) = 0.24, P = 0.91) or on error trials (T5e compared with T5d using two- 
tailed paired t-test, t(4) = 0.03, P = 0.97). j, Left, example traces showing 
position of the animal on non-cued control trials (T5n). Colours indicate 
positions during cue sampling (red), positions from cue port to food cups 
(black) and positions from food cups back to cue port (grey). Right, polar plot 
showing head direction distribution during the cue sampling period for the 
session shown on the left. Values on the right indicate mean head angle. The 
centre of the cue port is north, or 0°. k, Instantaneous speed on non-cued trials, 
averaged across 5 rats. Shading denotes s.e.m. Speed on non-cued trials (T5n) 
was compared at successive 10-ms bins with speed on corresponding bins of 
cued trials (T5). Differences were not significant at any time bin between 2s 
before and 4s after poke onset (q > 0.05, FDR corrected for multiple 
comparisons). 1, Mean head angle (top) and mean vector length of head 
direction (bottom) during the cue-sampling interval. Neither head angle nor 
mean vector length was different from corresponding values on cued trials 
(two-tailed paired t-test, t(4) = 1.85, P = 0.13 for mean angle; t(4) = 1.99, 
P= 0.12 for mean vector length). m, Mean speed (top), path length (middle) 
and run duration (bottom) for trajectories from cue port to food cups (left 
column) and from food cups to cue port (right column). None of these 
parameters were different between cued and non-cued tasks (two-tailed paired 
t-test; cue port to food cup movements: t(4) = 0.16, P = 0.88 for mean speed, 
t(4) = 0.50, P = 0.64 for path length, t(4) = 1.03, P = 0.36 for run duration; 
food cup to cue port movements: t(4) = 0.21, P = 0.84 for mean speed, 

t(4) = 0.30, P = 0.78 for path length, t(4) = 0.44, P = 0.68 for run duration). 
n, Sniff frequency at the cue port on non-cued trials, averaged across 5 rats. 
Frequency on cued trials is shown as a reference. Shading denotes s.e.m. Sniff 
frequency was compared at successive 10-ms bins between T5 and T5n. 
Differences were not significant at any time between 2s before and 4s after 
poke onset (q > 0.05; FDR corrected for multiple comparisons). 0, Mean sniff 
frequency during the cue sampling period was not different between T5 

and T5n (two-tailed paired t-test, t(4) = 0.89, P = 0.43). p, Mean sniff 
frequency during the cue sampling interval in novel odour trials. Sniff 
frequency did not change over the course of trials with novel odours (repeated 
measures ANOVA: F(6, 18) = 0.77, P = 0.60). 
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Extended Data Figure 3 | Power and coherence over a broader spectrum of 
frequencies in dCA1 and LEC at the end of training. a—d, Time-resolved 
power spectra averaged across tetrodes in a, dCA1 (n = 9 tetrodes), b, pCA1 
(n = 14 tetrodes), c, LEC (m = 10 tetrodes) and d, MEC (n = 8 tetrodes), as in 
Fig. le and g, but for a broader band of the frequency spectrum (0-140 Hz). 
Power is shown as percentage change from power during the pre-cue period. 
dCA1 and pCAI data are from the same animals (rats with tetrodes along the 
entire proximodistal CA1 axis), whereas LEC and MEC data are from separate 
animals (5 and 4 rats, respectively). Right panels show mean power for 
frequencies up to 140 Hz during cue sampling (red) and during running from 
cue port to food cups (blue). Peak frequencies are indicated. In CA1, the mean 
power in the 20-40-Hz frequency band increased from (1.07 + 0.04) x 10° 
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before cue sampling to (1.36 + 0.05) X 10 * during cue sampling and then 
reverted to (0.89 + 0.05) X 10 * mV” after cue sampling (repeated 

measures ANOVA: F(2, 78) = 118, P< 0.001). In LEC, the power was 

(1.05 + 0.12) X 10°, (1.62 + 0.16) X 10 * and (0.78 + 0.08) X 10 * mV’, 
respectively (F(2, 18) = 67.1, P< 0.001). e, f, Time-resolved coherence spectra 
averaged across tetrode pairs in LEC and dCAI (e) and in MEC and dCA1 (f), 
as shown in Fig. 1h, but across a broader band of the frequency spectrum (same 
animals as in c and d; data are averaged across all EC-dCA1 tetrode pairs). Note 
that, during running, MEC shows fast gamma oscillations (60-100 Hz) that are 
coherent with dCA1 LFPs (arrows). Right panels show mean coherence spectra 
during cue sampling (red) and during running (blue), with peak frequencies 
indicated. LEC did not show fast gamma oscillations. 
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Extended Data Figure 4 | Effect of running speed on oscillation frequency. 
a, Power spectra of oscillations in dCA1, pCA1, LEC and MEC as a function of 
running speed at T5 during random foraging in the 1 X 1 m square box. Power 
is normalized at each frequency bin and shown as a z-score’*. Data were 
averaged across 9 (dCA1), 14 (pCA1), 10 (LEC) and 8 (MEC) tetrodes from 
5 dCA1 and pCA1, 5 LEC and 4 MEC rats, respectively. b, Peak oscillation 
frequency (mean + s.e.m. for all tetrodes) as a function of speed bin in a. As in 
previous work", a significant correlation was observed between speed and peak 
frequency of oscillations in the 20-140-Hz range (P < 0.001; r(58) = 0.71, 0.44, 
0.60 and 0.59 for dCA1, pCA1, LEC and MEC, respectively). c, d, Speed- 
frequency relationship in the cue-place association task. Same tetrodes as in 
a. Data were plotted during the cue sampling period (c) and during subsequent 
running from the cue sampling port to the food cups (d). Note that during cue 


O43 @ 
speed (cm/s) 


— speed (cm/s) 

sampling, speed was minimal (~0-1 cms’ ‘). Strong 20-40-Hz oscillations 
were observed in dCA1, pCA1 and LEC (arrows in c) but not at similar speeds 
during running (d). e, In all four brain regions (dCA1, pCA1, LEC and MEC), 
the power at very low speeds (0-1 cms ') was significantly stronger during cue 
sampling than during running (paired t-test, P< 0.01; t(8) = 3.9, t(13) = 13.7, 
t(9) = 13.4 and t(7) = 4.5 for dCA1, pCA1, LEC and MEC, respectively), 
suggesting 20-40-Hz oscillations do not reflect low speed as such. f, During 
running from cue port to food cups, a significant positive correlation between 
speed and frequency was observed only in pCA1(P < 0.001, r(58) = 0.48). 

g, Time spent in each speed bin during the running part of the cued task 
(means + s.e.m. for all 14 rats). Note that all speed bins (including low speeds) 
were sampled for 2.5s or more. 
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Extended Data Figure 5 | 20-40-Hz oscillations in dCA1 and pCA1 are 
functionally decoupled. a, Time-resolved power spectra averaged across all 
tetrodes in the distal one-third of CA1 (dCA1, top) as well as the proximal one- 
third (pCA1, bottom) (9 and 14 recording sites, respectively). Data were pooled 
over 5 rats with hyperdrive implants spanning a wide transverse range of CA1. 
To control for impedance differences between tetrodes, LFP power during 
cue sampling was normalized, for each tetrode, to power during the pre-cue 
period when the animal was stationary at the odour port before the cue was 
delivered. Normalized power was averaged across tetrodes. Time t = 0 indicates 
cue onset. b, Positions of all tetrodes from which data were recorded in 

a, plotted in a 3D reconstruction of the CA1 cell layer. Distance from the 
anterior tip of subiculum is shown in micrometres. Position on the distal- 
proximal (DP) axis (0, distal; 1, proximal; see Supplementary Methods) was 
calculated in 3D space for each tetrode. CA1 tetrodes were grouped into distal 
(red point), intermediate (green point) and proximal (blue point) groups, each 
corresponding to one-third of the DP axis. c, Power spectra of LFP averaged 
over tetrodes in distal and proximal one-thirds of CA1 during cue sampling. 
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Power is shown as percentage change from the pre-cue level. During cue 
sampling, power in the lower part of the 20-40-Hz band was stronger in dCA1 
than pCA1 (11-25 Hz, green dots; q < 0.05; false discovery rate (FDR) 
corrected for 60 multiple comparisons at 1-60 Hz using 1 Hz bins, q < 0.05), 
whereas power in the higher part was lower in dCA1 (33-47 Hz, green dots; 
q < 0.05). d, To compare power in the lower and higher parts of the 20-40-Hz 
oscillation across the DP axis in CA1, the ratio of power in the slower part 
(20-25 Hz) and the faster part (33-40 Hz) was plotted as a function of DP 
distance. Each dot refers to one tetrode. Red, distal one-third; green, 
intermediate; blue, proximal one-third. Low/high power ratio was negatively 
correlated with DP distance (7(38) = —0.50, P< 0.001). e, Time-resolved 
coherence spectrum for dCA1 versus pCA1. Data were averaged over 28 
recording pairs in dCA1 and pCA1 from animals with tetrodes in both regions. 
f, Mean coherence between dCA1 and pCA1. Note lack of change in 20-40-Hz 
coherence during cue sampling compared to pre-cue or run periods (repeated 
measures ANOVA, F(2, 54) = 0.75, P = 0.48). 
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Extended Data Figure 6 | Comparison of power and coherence development 
during learning. a, b, Power of 20-40-Hz oscillations in dCA1 (a, n = 10 
tetrodes) and LEC (b, m = 10 tetrodes) from 5 rats at time points T1-T5, on 
error trials at T5 (T5e) and at T5 after down-sampling (T5d), shown as values 
normalized by power at time point T1. Power of 20-40-Hz oscillations did not 
change significantly during the course of learning (repeated measures ANOVA: 
F(4, 36) = 0.72, P = 0.58 for dCA1; F(4, 36) = 1.71, P= 0.17 for LEC). Power 
on error trials (T5e) was not significantly different from correct trials (T5d; 
two-tailed paired t-test, (9) = 1.40, P = 0.20 for dCA1; t(9) = 0.06, P = 0.94 for 
LEC). c, d, Power of theta oscillations in dCA1 (c) and LEC (d). Theta power did 
not change during learning (repeated measures ANOVA: F(4, 36) = 0.90, 
P= 0.47 for dCA1; F(4, 36) = 0.63, P = 0.64 for LEC). Theta power on error 
trials was not different from correct trials (two-tailed paired t-test, t(9) = 0.24, 
P= 0.82 for dCA1; t(9) = 0.40, P = 0.70 for LEC). e, Power of 20-40-Hz 
oscillations in dCA1 (left, n = 8 tetrodes) and LEC (right, n = 8 tetrodes) at the 
end of training with the original odours (A/B; time point T5), and with odours 
C/D and E/F (mean + s.e.m.) (4 rats). Power of the 20-40-Hz oscillations did 


) 
7423 1423 
Day 


not change significantly with the new odours, neither between T5 and the first 
day with C/D or E/F (t(7) < 0.5, P> 0.6 for CA1, t(7) < 0.9, P > 0.4 for LEC) 
nor during the course of learning with C/D and E/F (repeated measures 
ANOVA: F(6, 42) = 0.34, P = 0.91 for dCA1; F(6, 42) = 0.44, P = 0.85 for 
LEC). f, Power of theta oscillations in dCA1 (left) and LEC (right) 

(mean = s.e.m.). There was no significant change in the power of theta 
(repeated measures ANOVA: F(6, 42) = 0.13, P = 0.99 for dCA1; F(6, 42) 

= 0.71, P = 0.64 for LEC). g, Mean of LEC-dCA1 coherence spectra during cue 
sampling at successive time points during learning (T1-T5) and on 

error trials (T5e) and down-sampled correct trials (T5d), plotted in the same 
way as the time-resolved coherence spectra in Fig. 2b (n = 20 recording 

pairs from 5 rats). Dots above spectra denote frequencies with significant 
difference (FDR corrected for 62 multiple comparisons at 0-60 Hz, q < 0.05). 
h, Coherence averaged across the theta-frequency band during cue sampling in 
novel odour trials. No change was observed for coherence in the theta 
frequency band (repeated measures ANOVA: F(2, 30) = 0.02, P = 0.98 for cues 
C/D and F(2, 30) = 0.25, P = 0.78 for cues E/F). 
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Extended Data Figure 7 | Cross-frequency coupling of 20-40-Hz 
oscillations to local theta oscillations in LEC and pCA1. a-d, Relationship of 
20-40-Hz oscillations in LEC to phase of local theta oscillations, plotted as in 
Fig. 2d-g (n = 10 tetrodes from 5 rats). a, Theta phase distribution of 20-40-Hz 
oscillation maxima at T1, T3, T5 and on error trials at T5 (T5e). 0° was defined 
as the trough of the theta cycle. Note that theta oscillations exist in LEC and that 
20-40-Hz oscillations are moderately phase-coupled with theta oscillations 
already at T1. b, Mean vector length calculated from theta phase distribution of 
20-40-Hz oscillation maxima at T1-T5. The degree of cross-frequency 
coupling did not change significantly across the learning period (T1-T5, 
repeated measures ANOVA: F(4, 36) = 1.7, P= 0.17). No difference was 
observed on error trials (T5e, compared with T5d using a two-tailed paired 
t-test, t(9) = 0.78, P = 0.45). c, Representative cross-frequency coherence plot 
showing for LEC that power of 20-40 Hz oscillations (y-axis) is modulated by 
theta phase (x-axis) during cue sampling at T5. Coupling strength is colour- 
coded (dark blue, no coupling; red, maximal coupling). d, Top, time-resolved 
power spectrum averaged across all theta cycles with 20-40-Hz oscillations at 
T5 in LFP from 10 tetrodes in all 5 rats. t= 0 corresponds to the theta trough. 
Bottom, averaged unfiltered theta cycle. 20-40-Hz oscillations occurred at the 
falling phase of the theta wave. e-h, Similar plots for pCA1 (n = 14 tetrodes 


Pre-cue theta phase (deg) 


from 5 rats; e, theta phase distribution; f, mean vector length; g, cross-frequency 
coherence plot; h, time-resolved power spectrum). Animals with implants 
in pCA1 were recorded only after the completion of learning, that is, only 
at T5. In pCA1, oscillations at ~30-50-Hz were phase-coupled with theta 
oscillations. The degree of coupling did not change on error trials (two-tailed 
paired t-test, (13) = 1.29, P = 0.22). i, Same plots as in Fig. 2f, but with wider 
60° bins. The diagram shows the relationship of 20-40-Hz oscillations in dCA1 
to the phase of local theta oscillations (n = 10 tetrodes from 5 rats). The use of 
wider bins did not change the results. Significant cross-correlations between 
cell pairs in LEC and CA1 were not found, as expected due to the sparse 
connectivity between cell pairs in these areas’*. j, Theta phase distribution of 
20-40-Hz oscillation maxima for LFP from dCA1 as in Fig. 2f, but during the 
pre-cue period, at time points T1, T3, T5 (n = 10 tetrodes from 5 rats, 
means + s.e.m.). k, Mean vector length calculated from theta phase 
distributions of 20-40-Hz oscillation maxima during the pre-cue period did 
not change during the course of training (repeated measures ANOVA: 

F(4, 36) = 0.26, P = 0.90; n = 10 tetrodes from 5 rats). Vector lengths during 
cue sampling was increased compared to the pre-cue period at T5 (paired t-test, 
P<0.05; t(9) = 2.5) but not at TI-T4 (P > 0.05; t(9) < 1.9). 
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Extended Data Figure 8 | A large fraction of CAl and MEC cells with 
activity at the cue port were place cells and grid cells, respectively. a, Spatial 
distribution of firing in example cells with cue-port activity recorded in dCA1, 
pCA1, LEC or MEC in either the odour-cue association task (top) or a random 
foraging task in a 1 m square box (bottom). Each column shows results for one 
representative cell. For each cell, spike position (red) is overlaid on the 
trajectory of the rat (grey) at the top and a colour-coded frequency map is 
shown at the bottom. Red is maximum firing rate, as indicated by the scale bar. 
Rat number, tetrode number (f) and cell number (c) are indicated above each 
path diagram. Peak frequencies are indicated at the top right of the colour map. 
Note that dCA1/pCA1 and MEC cells had clearly distinguishable place fields or 
grid fields in the foraging task, whereas the LEC cell showed no clear spatial 


modulation. b, Top, distribution of spatial information scores calculated from 
firing rate distributions in the random foraging task for cells with cue-port 
activity in dCA1, pCA1, LEC and MEC. Results in dCA1 are shown for both 
microdrive (MD) and hyperdrive (HD) implants. Bottom, distribution of 
shuffled data based on 100 permutations per cell. Red lines indicate 95th 
percentile value (chance level) for a distribution based on all permutations in 
each area. 95th percentile value is indicated in red. Percentage of cells that 
exceeded chance level is shown for each region. Note that 74% and 81% of 
the cue-port cells were spatially modulated in dCA1 and pCAI, respectively, 
that is, they were place cells*. ¢, As in b, but for the distribution of gridness 
scores. Note that 89% of cue port cells in MEC had gridness scores that 
exceeded chance level and so were defined as grid cells*’. 
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Extended Data Figure 9 | Phase-locking of individual neurons to the 
20-40-Hz rhythm. a, Raster plots showing cue-port activity of example dCA1 
cell on successive trials at T5. Rows correspond to individual trials; ticks 
indicate spikes. Top, trials with food in left cup; middle, trials with food in right 
cup; bottom, peri-stimulus time histogram (PSTH); orange, left-predicting 
trials; black, right-predicting trials. Between T1 and T5, the number of dCA1 
cells with mean rates above 1 Hz during the cue interval ranged from 63 to 75 
(5 rats). b, Spike-time distribution for dCA1 principal cells with cue-port 
activity across phase of local 20-40-Hz oscillation at T1-T5 and T5e. 

c, d, Percentage of significantly phase-locked cells (c) and mean vector length of 
distribution in b (d), averaged across dCA1 cells with cue-port activity. T5d as 
in Fig. 2c. ** as in Fig. 2c. e-h, LEC cells, as in a-d. Between T1 and T5, 

the number of LEC cells with mean rates above 1 Hz during the cue interval 
ranged from 76 to 82 (5 rats). The number of cue-port cells phase-locked to the 
local 20-40-Hz LFP increased significantly from T1 to T5 in both dCA1 

and LEC (main text) and there was a significant increase in the phase locking 
of each cell (ANOVA for mean vector length: dCA1: F(4, 348) = 2.81, 

P= 0.026; LEC: F(4, 383) = 21.0, P< 0.001). On error trials, the mean vector 
for the spike-phase distribution decreased in both dCA1 and LEC (dCA1: 
two-tailed paired t-test, t(72) = 2.97, P = 0.004; LEC: t(81) = 4.05, P< 0.001). 
i, j, Interregional phase-locking of individual dCA1 and LEC cells. i, Phase- 
locking of dCA1 spikes as shown in c and d, but against 20?40-Hz oscillations in 
LEC. Left, percentage of significantly phase-locked cells. Right, mean vector 
length of the spike-phase distribution. The percentage of cells that was 
significantly phase-locked to the oscillations increased from 7.9 at T1 to 17.4 at 
T5 (P< 0.005; binomial test with Bonferroni correction). This increase was 
matched by an increase in the mean vector length of the spike-phase 
distribution (ANOVA for mean vector length: F(4, 348) = 2.8, P = 0.03). 

T5e and T5d indicate T5 error trials and down-sampled correct T5 trials, 
respectively. Mean vector length on T5e was significantly reduced compared to 
T5d (two-tailed paired t-test, t(72) = 2.60, P = 0.011). j, Phase-locking of 
LEC spikes as shown in g and h, but against 20-40-Hz oscillations in dCA1. 
The percentage of LEC cells significantly phase-locked to 20-40-Hz oscillations 
in dCA1 increased from 13.3 at T1 to 24.4 at T5 (P<0.005). This was 
accompanied by a significant increase in the mean vector length of the 
spike-phase distribution (F(4, 383) = 2.7, P = 0.03). Mean vector length on 
error trials (T5e) decreased significantly compared to down-sampled correct 
trials (T5d; two-tailed paired t-test, (81) = 2.42, P = 0.017). *P <0.05, 
Bonferroni post-hoc test; *P<0.05, paired t-test. 
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Extended Data Figure 10 | ROC-based and firing rate-based analysis of development of odour-specific representations, we subsequently computed 
odour-specific representations in dCA1 and LEC. a, Schematic Selectivity using ROC analysis, a method based on signal detection theory: 


representation of odour-type spike representations based on receiver operating Selectivity (ROC) was computed as 2 X (auROC — 0.5), where auROC is the 
characteristic (ROC) analysis (left) and direct comparison of firing rates (right). | area under the ROC curve computed from spike numbers for left and right 


Odour-specific representation was assessed using the metric Selectivity. trials on individual 100-ms bins. After scaling, for both Selectivity metrics, 
Selectivity was computed by comparing firing rates during sampling periods _ Selectivity = 1 indicates that the cell fired only on left-associated odour trials, 
for left-associated and right-associated cues in successive 100-ms bins: whereas Selectivity = -1 denotes firing only on right-associated odour trials. 


Selectivity was first expressed as (FR1 — FR2)/(FR1 + FR2), where FR1 and b, Trial-type representations for all dCA1 cells with activity at the cue port 
FR2 are mean firing rates for multiple trials on individual 100-ms bins during _ shownas in Fig. 4a but using ROC analysis. Note development of selectivity also 
left- and right-associated cue samplings, respectively. To confirm the with this method. c, As in b, but for LEC cells. 
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Trace-gas metabolic versatility of the facultative 
methanotroph Methylocella silvestris 


Andrew T. Crombie! & J. Colin Murrell! 


The climate-active gas methane is generated both by biological pro- 
cesses and by thermogenic decomposition of fossil organic material, 
which forms methane and short-chain alkanes, principally ethane, 
propane and butane’”. In addition to natural sources, environments 
are exposed to anthropogenic inputs of all these gases from oil and 
gas extraction and distribution. The gases provide carbon and/or 
energy for a diverse range of microorganisms that can metabolize 
them in both anoxic’ and oxic zones. Aerobic methanotrophs, which 
can assimilate methane, have been considered to be entirely distinct 
from utilizers of short-chain alkanes, and studies of environments 
exposed to mixtures of methane and multi-carbon alkanes have as- 
sumed that disparate groups of microorganisms are responsible for 
the metabolism of these gases. Here we describe the mechanism by 
which a single bacterial strain, Methylocella silvestris, can use meth- 
ane or propane asa carbon and energy source, documenting a metha- 
notroph that can utilize a short-chain alkane as an alternative to 
methane. Furthermore, during growth on a mixture of these gases, 
efficient consumption of both gases occurred at the same time. Two 
soluble di-iron centre monooxygenase (SDIMO) gene clusters were 
identified and were found to be differentially expressed during bac- 
terial growth on these gases, although both were required for efficient 
propane utilization. This report of a methanotroph expressing an 
additional SDIMO that seems to be uniquely involved in short-chain 
alkane metabolism suggests that such metabolic flexibility may be 
important in many environments where methane and short-chain 
alkanes co-occur. 

Most of the 500-600 Tg methane emitted into the atmosphere each 
year is of recent biological origin, whereas fossil-derived ‘natural gas’, 
from both anthropogenic and natural sources, contributes approximately 
30% of the total*. Large anthropogenic releases of natural gas have 
included the Deepwater Horizon spill of 2010 (ref. 5), but intentional 
or unintentional operational releases are also widespread and likely to 
increase with the exploitation of unconventional resources, including 
shale gas extraction (fracking)°. A spectacular natural release, contain- 
ing methane and 35% (v/v) ethane and propane’, results in the ‘eternal 
flame’ in Chestnut Ridge Park, New York. Widespread seepage, which 
is largely undocumented, also occurs in terrestrial areas that overlay sed- 
imentary organic carbon®. The natural geological methane source, includ- 
ing terrestrial macro-seeps and micro-seeps, marine seeps and volcanic 
and geothermal emissions, is the second largest natural source after wet- 
lands. Micro-seeps, which are potentially active in substantial areas (in- 
cluding petroliferous regions and sedimentary basins where thermal 
degradation of organic material has occurred), contribute 10-25 Tg 
methane annually*. This thermogenic natural gas contains methane 
and up to 50% (w/w) ethane, propane and butane’. In addition, un- 
quantified biogenic production of ethane and propane occurs in an- 
oxic environments similar to those that support methanogenesis and 
homoacetogenesis’. 

Aerobic methanotrophs oxidize much of the methane that is released 
from terrestrial and marine environments before it reaches the atmo- 
sphere and are well characterized’, and microorganisms that grow on 
short-chain alkanes, including propane, have also received attention’. 


However, so far, all evidence has emphasized the striking distinction 
between the groups of microorganisms performing these respective 
tasks. Aerobic methanotrophs are Gram-negative bacteria from about 
16 genera within the Gammaproteobacteria and Alphaproteobacteria 
classes, as well as some extremophiles of the phylum Verrucomicro- 
bia. Two types of enzyme that can oxidize methane are known: the 
membrane-associated particulate methane monooxygenase (pMMO), 
which was previously thought to be present in all methanotrophs; anda 
cytoplasmic, soluble form of this enzyme (sMMO), which is also present 
in a few extant strains. The sMMOs form one group ofa large family of 
di-iron carboxylate enzymes that includes SDIMOs that allow growth 
on a range of short-chain alkanes, alkenes and aromatic compounds. 
Recently, a few methanotrophs were shown to have restricted growth 
on two-carbon compounds, including acetate’®; however, there are no 
validated reports of growth on (as opposed to co-oxidation of) gaseous 
hydrocarbons other than methane. Short-chain alkane utilizers have 
been thought to be incapable of methane metabolism, and most are 
members of the class Actinobacteria’. An exception is Thauera buta- 
nivorans, which is a betaproteobacterium that grows on butane using a 
monooxygenase that is two to three orders of magnitude more specific 
for butane than for methane”, despite its sequence similarity (up to 65%) 
to bona fide methane monooxygenases. 

Methanotrophic strains of the genus Methylocella, first isolated from 
peat, tundra and forest soils in Northern Europe’’, are widespread in a 
diverse range of terrestrial environments’? and have also been found, 
for example, among the microbial community following the Deepwater 
Horizon incident”. The ability of M. silvestris to grow on organic acids 
and alcohols is known, as are details of its sMMO'*". Here we further 
investigated the metabolic potential of this organism (Extended Data 
Table 1). M. silvestris grew on propane (2.5% or 20% (v/v)) at a similar 
rate to its growth solely on methane in batch culture (Table 1), reaching 
a high cell density in a fermenter supplied with 20% (v/v) propane in air 
(Extended Data Fig. 1). It also grew on potential intermediates of pro- 
pane oxidation: propionate, 2-propanol and acetone. Its growth on meth- 
ane was not inhibited by the presence of 1% or 10% (v/v) propane in the 
methane, and indeed the conversion of substrate carbon into biomass 
was slightly enhanced under both of these conditions of dual substrate 


Table 1 | Growth rates of Methylocella silvestris strains 
Growth rate (h~+) 


Strain Substrate On 2.5% (v/v) On 20% (v/v) 
Wild-type Methane 0.012 + 0.001 0.013 + 0.003 
Propane 0.005 + 0.000 0.015 + 0.002 
Mixture 0.011 + 0.001 D 
AMmoxX Methane No growth No growth 
Propane No growth 0.010 + 0.000 
Mixture No growth D 
APrmA Methane 0.010 + 0.003 0.019 + 0.005 
Propane 0.003 + 0.000 No growth 
Mixture 0.010 + 0.000 D 
Specific growth rates of wild-type M. silvestris and the AMmoxX and APrmA strains during growth on 
methane or propane (at the concentrations shown) or a mixture (2.5% (v/v) each). The data are the 
mean + s.d. of triplicate vials (except for 2.5% (v/v) propane, for which duplicate vials were used). 


ND, not determined. 
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Figure 1 | Propane enhanced Methylocella silvestris growth, and methane 
and propane were consumed by cultures at rates corresponding to their 
relative concentrations. a, The specific growth rate (y), maximum culture 
density (optical density at 540 nm (ODs,49)) and carbon conversion efficiency 
(CCE) of cultures supplied with equimolar amounts of carbon as methane 
(10% (v/v), white) or mixtures of methane and propane (99:1 (v/v), grey; 90:10 


availability (Fig. 1a). To identify any substrate preference, liquid cultures 
of M. silvestris supplied with various mixtures of methane and propane 
were monitored. The components of the gas mixtures, ranging from 
1:10 to 100:1 (methane:propane), were differentially consumed in direct 
proportion to their relative headspace concentrations (Fig. 1b), with 
methane being consumed at a volumetric rate approximately 1.5 times 
that of propane when supplied at equivalent concentrations. The sim- 
ilar Henry’s law coefficients” of methane and propane (Ky = 1.3 X 10°* 
and 1.4 X 10° * M atm‘, respectively) and consequent aqueous con- 
centrations indicated that the overall kinetic parameters in these micro- 
cosms were similar and that under these conditions M. silvestris did not 
exhibit a major substrate preference for either of these gases. 

Analysis of the genome of M. silvestris’? confirmed the absence of 
the membrane-associated pMMO but revealed an SDIMO gene cluster 
in addition to that encoding sMMO. Phylogenetic analysis of the puta- 
tive hydroxylase «-subunit grouped this second SDIMO with those of 
propane-oxidizing bacteria, including Gordonia sp. TY-5 (ref. 14). These 
SDIMOs belong to Group V SDIMO enzymes as defined by Coleman 
and co-workers” (Extended Data Fig. 2a), suggesting that in M. silvestris 
this gene cluster might encode a propane monooxygenase (PrMO). 

Transcription of this SDIMO was confirmed by reverse-transcription 
PCR (RT-PCR); 5’ rapid amplification of cDNA ends (RACE) was used 
to identify the transcription start site, which is 120 nucleotides upstream 
(5') of the predicted start codon, and a putative o™* promoter was located 
(Extended Data Fig. 2b). A reporter gene (green fluorescent protein 
(GFP)) under the control of each of the putative promoters of these two 
enzyme systems (sMMO and PrMO) was then expressed in M. silvestris 
from plasmids: the sMMO genes, but not the PrMO genes, were tran- 
scribed during growth on methane, but both sets of genes were tran- 
scribed during growth on propane (Fig. 2). The protein profiles differed 
between these two growth conditions, and mass spectrometric analysis 
of bands excised from a gel confirmed that protein expression mirrored 
gene transcription (Extended Data Fig. 3a, b). 

The sMMO from other methanotrophs can co-oxidize numerous sub- 
strates, including propane”, although these organisms cannot assimilate 
the oxidation products, whereas the specificity of PrMO is relatively 
unknown. Therefore, although methanotrophs have been notoriously 
difficult to genetically manipulate, two deletion mutants were constructed 
by targeted marker-exchange mutagenesis, to unravel the relative con- 
tribution of each enzyme to growth on alkanes. Wild-type M. silvestris 
and M. silvestris strain AMmoX and strain APrmA (with deletions of the 
hydroxylase «-subunits of sMMO and PrMO, respectively) were grown 
in batch culture on 20% (v/v) methane or 20% (v/v) propane. At this sub- 
strate concentration, the deletion strains did not grow on the gas for 
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(v/v), black). The data are normalized to the mean (1 = 0.018 hl, 

ODs40 = 0.36 and CCE = 0.28 mg protein per mg substrate carbon); n = 3 vials 
per growth condition. Error bars, s.d. Significance was determined by single 
factor analysis of variance (ANOVA) (compared with growth on 100% 
methane): *, P< 0.05; **, P< 0.01). b, Relative rates of methane and propane 
consumption by cultures supplied with a range of gas mixtures. 


which they lacked the nominal oxidizing enzyme (Table 1). When 
cultures were supplied with 2.5% (v/v) methane or 2.5% (v/v) propane, 
the wild-type strain grew on both substrates. By contrast, the AMmoX 
strain did not grow on 2.5% (v/v) methane, 2.5% (v/v) propane or a mix- 
ture of these (2.5% (v/v) of each). The APrmA strain grew on 2.5% (w/v) 
methane similarly to the wild-type and, interestingly, also on 2.5% (v/v) 
propane but at a reduced rate (Table 1) and with a 30% reduction in 
the conversion of the substrate carbon into biomass compared with 
the wild-type strain (Extended Data Fig. 4a). During growth on a mix- 
ture of methane and propane (2.5% (v/v) each), only a marginal dif- 
ference in growth rate was observed between the wild-type and APrmA 
strains, and the APrmA strain consumed both gases simultaneously, 
similarly to the wild-type strain, although a small decrease in the car- 
bon conversion efficiency was evident (Fig. 3). We also noted that the 
carbon source (succinate or methane plus propane) for the cells used as 
the inoculum did not affect a culture’s subsequent simultaneous oxida- 
tion of methane and propane (Extended Data Fig. 4b). 
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Figure 2 | Methane induced transcription of sMMO only, whereas propane 
induced both sMMO and PrMO. Fluorescence microscopy images of 

M. silvestris cells grown to late exponential phase on methane alone (a, b) or 
propane alone (c, d) transformed with plasmids containing the sMMO 
promoter (a, c) or the PrMO promoter (b, d) fused to a GFP (green) reporter 
gene. The images are representative of one of two independent experiments. 
Cells are approximately 1.5 jum in length. 
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Because sMMO can oxidize both methane and propane, the converse 
ability of PrMO, to oxidize methane, was evaluated by comparing meth- 
ane consumption by the wild-type and AMmoxX strains during growth 
on 20% (v/v) propane in the presence of 2% (v/v) methane. The wild- 
type strain consumed both gases in proportion to their relative concen- 
trations, as expected, whereas there was no detectable consumption of 
methane by the AMmoxX strain (Extended Data Fig. 4c, d). 

The relative inability of PrMO to oxidize methane was used to inter- 
pret the comparative substrate oxidation rates by whole cells of wild- 
type M. silvestris, as obtained by polarographic measurement of oxygen 
uptake in response to the addition of methane or propane, in an oxygen 
electrode. The maximum methane oxidation rates of cells grown on meth- 
ane (expressing only sMMO) were 2.3-fold higher than those of cells 
grown on propane. By contrast, the maximum propane oxidation rates 
were 2.4-fold higher in propane-grown cells (expressing both sMMO and 
PrMO) than in methane-grown cells, indicating that in propane-grown 
cells, the propane-oxidizing ability was due mainly to PrMO (Extended 
Data Fig. 5a). Using this technique, methane-grown cells had an apparent 
Michaelis constant (K,,) of 53 uM for methane, and propane-grown 
cells had an apparent K,,, of 19 uM for propane (Extended Data Fig. 5b, c). 
The relatively high affinity of PrMO for propane emphasizes its impor- 
tant role in propane oxidation and is consistent with the reduced growth 
rate of the APrmA strain on this substrate. Therefore, M. silvestris con- 
tains two enzymes capable of propane oxidation, sMMO and PrMO, and 
deletion of PrMO is not sufficient to completely eliminate the oxida- 
tion of propane. 

Bacterially mediated propane oxidation typically occurs at the terminal 
or sub-terminal carbon, producing 1-propanol or 2-propanol, respec- 
tively, or a mixture’’. Terminal oxidation proceeds from 1-propanol to 
propanal and propionate, whereas 2-propanol is oxidized to acetone. 
Oxygen electrode assays demonstrated that, in comparison with cells 
grown on methane or succinate, propane-grown cells consumed three- 
fold or fivefold more oxygen, respectively, in response to addition of 1- 
propanol and propanal (Extended Data Fig. 6a), and growth tests indicated 
that 1-propanol-metabolizing ability was induced in propane-grown cells 
(Extended Data Fig. 7a—d). Quantitative label-free mass-spectrometric 
proteomic analyses showed that polypeptides of the methylmalonyl- 
CoA pathway of propionate oxidation were induced during growth on 
propane”, indicating terminal oxidation. However, 2-propanol was detected 
in culture media, reaching a maximum of approximately 0.5 mM at 
200 h during growth on 4% (v/v) propane, demonstrating that at least 
25% of propane was oxidized at the sub-terminal position (Extended 
Data Fig. 8). When propane was depleted, 2-propanol was consumed 
by the cultures, and enzyme activities associated with the oxidation of 
sub-terminal intermediates were upregulated in propane-grown cells (Ex- 
tended Data Fig. 6b, c). Therefore, in M. silvestris, propane was oxidized 
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, 3.0 Figure 3 | Inactivation of PrMO had a marginal 
effect during growth on a mixture of methane 
and propane. Growth (left y axis, black lines) and 
substrate consumption (right y axis, coloured lines) 
of wild-type M. silvestris (solid lines, filled symbols) 
and the APrmA strain (dashed lines, open 
symbols). The inoculum was grown on succinate. 
The experimental cultures were supplied with a 
mixture of methane and propane (2.5% (w/v) each). 
The headspace concentrations of methane (red) 
and propane (purple) were determined by gas 
chromatography. n = 3 vials per strain. Error bars, 
s.d. The inset shows the growth parameters for the 
wild-type strain (dark grey) and the APrmA strain 
(light grey). The inset data are normalized to the 
mean (j= 0.010h ', ODs49 = 0.43 and 

CCE = 0.30 mg protein per mg substrate carbon); 
n = 3 vials per strain. Error bars, s.d. Significance 
was determined by Student’s t-test: **, P< 0.01. 
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to both 1-propanol and 2-propanol. However, whereas 2-propanol (reach- 
ing 1.3 mM at 1,050 h) was also detected in the medium of the APrmA 
strain (grown on 4% (v/v) propane) (Extended Data Fig. 9a), it was not 
detected in the medium of the AMmoxX strain (grown on 20% (v/v) 
propane). The sMMO of obligate methanotrophs has previously been 
shown to oxidize n-alkanes to a mixture of primary and secondary 
alcohols’, and our data suggest that in M. silvestris, 2-propanol is a 
product of sMMO but not a major product of PrMO. 

We therefore examined the protein expression pattern of the wild- 
type strain during growth on 2-propanol (Extended Data Fig. 3), and 
we observed that, under these conditions, PrMO, but not sMMO, was 
expressed. Therefore, these data suggest that propane oxidation by sMMO 
is sufficient for the induction of PrMO. The inability of the AMmoX 
strain to grow at low propane concentrations probably resulted from a 
lack of sub-terminal metabolites, which may be responsible for the induc- 
tion of PrMO expression. Interestingly, deletion of prmA also halved the 
growth rate of cultures on 2-propanol but had no significant effect on 
growth on methanol, ethanol, acetate or acetone (Extended Data Fig. 9b). 
A similar unexplained phenomenon has been noted by others'*” after 
the genetic inactivation of propane-oxidizing enzymes in Gordonia 
and Mycobacterium spp. The decreased oxidizing ability, and consequent 
increased accumulation, of 2-propanol in the APrmA strain would also 
have contributed to its growth rate and carbon conversion efficiency being 
lower than those of the wild-type during growth on low concentrations 
of propane. Similarly, during incubation of this strain with higher pro- 
pane concentrations, the 2-propanol that formed as a result of propane 
oxidation by sMMO may have accumulated to toxic levels, thereby 
inhibiting growth. This is consistent with PrMO having a role in the 
oxidation of 2-propanol, either directly or through a regulatory mecha- 
nism. To identify the ability of PrMO to directly participate in 2-propanol 
oxidation, and because acetylene is a potent irreversible inhibitor of 
SDIMOs”*, we grew the wild-type strain on propane, 2-propanol or ace- 
tone in the presence or absence of 2% (v/v) acetylene. Growth on pro- 
pane was entirely prevented by the presence of acetylene, whereas growth 
on acetone was unaffected. Although this inhibition of PrMO reduced 
the specific growth rate of cultures supplied with 2-propanol by approx- 
imately 11% compared with uninhibited controls (Extended Data Fig. 9c), 
this reduction was far less than that caused by the gene deletion in the 
APrmA strain, suggesting that the oxidation of 2-propanol by PrMO 
was relatively minor. Therefore, the exact mechanism by which PrMO 
is required for efficient 2-propanol metabolism remains to be elucidated. 

To better understand the significance and role of facultative metha- 
notrophs in the environment, we investigated the behaviour of whole 
cells in culture, rather than taking the more reductionist approach of 
analysing the specificities and kinetics of purified enzymes in vitro. The 
data indicated that sMMO was necessary and sufficient for growth on 
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methane but that both SDIMOs were required for efficient metabolism 
of propane. Although sMMO was capable of propane oxidation, it did 
not exclusively perform this role, and cells lacking PrMO were at a dis- 
advantage that increased with increasing substrate concentration. This 
report is to our knowledge the first to describe an organism that can 
grow on methane or a short-chain alkane, disproving the dogma that 
separate microorganisms are responsible for the metabolism of meth- 
ane and other simple hydrocarbon gases, such as propane. The pmoA 
gene, which encodes the o-subunit of pMMO, has frequently been used 
as an environmental probe for methanotrophs”, raising the possibility 
that sMMO-only methanotrophs have been overlooked, and some of 
these might have similar capabilities to those described here. Given the 
widespread co-occurrence of methane and short-chain alkanes, includ- 
ing propane, in the environment, the potential for simultaneous oxidation 
of these gases may be considerable. This finding also has implications for 
studies of carbon cycling in environments where natural gas is released, 
such as the 8 million square kilometres of Earth’s terrestrial regions, where 
micro-seeps may be abundant’, or near marine methane seeps, oil spills 
and gas extraction and distribution systems. 


METHODS SUMMARY 


Methylocella silvestris BL2 was grown in 20 ml or 25 ml volumes in 120-ml serum 
vials sealed with rubber stoppers as previously described” and supplied with meth- 
ane or propane by injection through the septum into the headspace. Headspace gas 
concentrations were quantified by gas chromatography, as were metabolites after 
solvent extraction from culture medium. The carbon conversion efficiency was calcu- 
lated as the ratio of culture protein production to substrate carbon depletion. DNA 
and RNA procedures followed standard methods, and transcriptional start sites 
were determined using RACE. GFP-expressing reporter strains were constructed 
as previously described”. For gel-based protein analysis, bands of interest were excised, 
and polypeptides were identified by mass spectrometry. Mutant strains were con- 
structed as previously described”. Polarographic activity assays used a Clark-type 
oxygen electrode to track substrate-induced oxygen consumption. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 

Cultivation of Methylocella silvestris. Methylocella silvestris BL2 was grown in 
dilute nitrate mineral salts (DNMS) medium (20 ml or 25 ml) in 120-ml serum vials 
sealed with rubber stoppers as previously described*’ and supplied with methane or 
propane by injection through the septum into the headspace. Methane and propane 
were a minimum of 99.5% purity grade. Large-scale cultivation was carried out in 
41 fermenters (LH Series 210, LH Engineering), or 2 1 fermenters (FerMac 300, 
Electrolab) supplied with methane or propane (50-200 ml min’) and air. Other 
substrates were used at a concentration of 5 mM (succinate and acetate) or 0.05% 
(v/v) (methanol, ethanol, 2-propanol and acetone) except where indicated. To 
calculate growth rates, the natural logarithm of culture density (as measured by 
optical density at 540 nm (ODs49)) was plotted against time, and a straight line was 
fitted to the exponential phase of growth using Microsoft Excel. The growth rate 
was determined for each culture from a minimum of three data points or two data 
points separated by at least 48 h. The increase in biomass was defined as the dif- 
ference between the initial and maximum densities (OD549) reached by the culture. 
The consumption of methane or propane was calculated from the depletion of 
headspace substrate between consecutive time points during the growth phase and 
was plotted against the mean of the concentrations at these points. The carbon con- 
version efficiency was evaluated as the ratio of protein production to substrate car- 
bon depletion. 

Measurement of substrate depletion and product formation. The headspace 
gas concentrations were quantified using an Agilent 6890A or 7820A gas chromato- 
graph equipped with a Porapak Q column (Supelco) coupled to a flame ionization 
detector (FID), with injector, column and detector temperatures of 150 °C, 125°C 
and 200 °C, respectively. Standard curves for methane and propane were prepared 
using standards containing known concentrations of the pure gases in air. The 
products of propane oxidation in the culture medium were quantified by solvent 
extraction. Aliquots (1.5 ml) of the culture were centrifuged (16,600 g, 5 min, room 
temperature (21 °C) to pellet the cells. The supernatant (800 il) was added to 400 ll 
ethyl acetate (containing 500 parts per million volume (p.p.m.v.) 1-butanol as an 
internal standard) and approximately 0.4 g NaCl, (sufficient to saturate the aque- 
ous phase), in 1.5 ml tubes. The samples were mixed for 30 min on a tube rotator 
and briefly centrifuged to separate the phases. An autosampler was used to inject 
5 pl organic phase (100:1 split) into an Agilent 7890A gas chromatograph equipped 
with an HP-INNOWax column (30 m X 0.32 mm X 0.25 pum) and FID. The injector 
temperature was 250 °C; the detector was at 250 °C; and the oven was programmed 
to hold at 45 °C for 5 min, then to increase from 45 °C to 250°C at 10°C min |, 
and then to hold at 250 °C for 5 min. The retention time for acetone was 2.40 min; 
1-propanol, 3.34 min, 2-propanol, 5.40 min; 1-butanol, 7.45 min; and acetol, 9.70 min. 
Compounds of interest were quantified using aqueous standards extracted using 
the same method. 

Measurement of cellular protein. Total cellular protein was measured following a 
previously reported method”! with minor modifications. Aliquots of cultures (500 il) 
were stored frozen for analysis. Following thawing, 87.5 il 20% (w/v) SDS and 
87.5 pl 1.6 M NaOH were added, and the tubes were heated in a boiling water bath 
for 5 min. The tubes were cooled, and the NaOH was neutralized by the addition of 
87.5 ul 1.6M HCL, centrifuged (13,000 g, 5 min, 21°C), and 650 ul supernatant 
was removed to fresh tubes. The protein was assayed using a Pierce BCA kit (Fisher 
Scientific), following the manufacturer’s recommendations and using standards 
(500 pl) containing 0-120 pg bovine serum albumin ml’, treated in an identical 
manner to the samples. 

Phylogenetic analysis of MmoX and PrmA protein sequences. BLAST searches 
of the M. silvestris genome” identified two putative SDIMO-encoding gene clus- 
ters: Msil_1262-Msil_1270 and Msil_1651-Msil_1646. The amino acid sequences 
of the putative o-subunits, encoded by Msil1262 and Msil1651, designated mmoX 
and prmA here, were combined with selected MmoX sequences from methano- 
trophs and related SDIMO sequences from non-methanotrophs and aligned using 
the Clustal program in MEGA 5 (ref. 32). Positions containing gaps or missing 
data were eliminated, and 356 amino acid positions in the final data set were used 
to construct a tree using the maximum likelihood method. Bootstrap values were 
based on 500 replications. 

DNA and RNA methods. DNA was extracted from M. silvestris and manipulated 
using standard methods”. To visualize transcription from the mmoX and prmA 
promoters, the vector pMHA203 (ref. 21), which contains the mmoX promoter fused 
to the GFP gene, was modified by replacement of the mmoX promoter sequence 
with the prmA promoter sequence. The primers PrPf (5'-ACTCAATTGTCCGT 
TCCGTAACGCCTCTC-3’) and PrPr (5’-CGGCCGGCTGAGCTCCCGCTAC 
GC-3’) were used to amplify the 1,112 base pair (bp) promoter-containing region 
extending upstream from 44 bp 5’ of the predicted start codon of prmA. (The MunI 
and SacI sites are underlined.) The PCR product was cloned into pCR2.1 TOPO 
(Invitrogen) and excised by digestion with Mun] and Sacl. Following EcoRI/Sacl 
excision of the mmoX promoter region from pMHA203, the MunI/Sacl fragment 


was ligated into these sites, resulting in pAC304. pMHA203 and pAC304 were 
introduced into M. silvestris by electroporation”’, and transformed cells were selected 
on kanamycin-containing DNMS plates. Cells were visualized using an Axioscop 
fluorescence microscope (Zeiss). 

Total RNA was isolated using a previously described method** Trace residual 
DNA was removed by two treatments using QIAGEN RNase-free DNase, each 
followed by purification using an RNeasy spin column (QIAGEN). Reverse tran- 
scription was performed using SuperScript II or SuperScript III (Invitrogen), using 
50-1,000 ng total RNA with 200 ng random hexamer primers. 5’ RACE was per- 
formed using the 2nd Generation 5'/3’ RACE kit (Roche), following the manu- 
facturer’s instructions, using the gene-specific antisense primers 51Ral (5'-TCGT 
CGTCGCATAGCACTTG-3’), 51Ra2 (5’-ATCGTGGAATGGCGGAACTC-3’) 
and 51Ra3 (5'-TCTTCCTGCATCGGAAAGTACG-3""). 

Mutagenesis. Mutant strains were constructed as previously described”. For dis- 
ruption of the sMMO, DNA fragments A and B (upstream and downstream of mmoX) 
were amplified by PCR using the primers 1262Af (5'-GATCGAGCTCCGACAC 
GGAAACAACCTATC-3’) and 1262Ar (5’-GATCACGCGTTTCGTCGCGGT 
GCTTAATGC-3’), and 1262Bf (5’-GATCCAATTGTCGCCGATCCGCTCGC 
AG-3’) and 1262Br (5'-GATCGAATTCCGATCGAGCGCACAGCTCC-3’) (restric- 
tion sites underlined). The products were cloned into pCR2.1 TOPO that had been 
excised with SacI/Mlul and MunI/EcoRI and then ligated sequentially into pCM184 
(ref. 35) that had been digested with the same enzymes. The linear SacI fragment 
was introduced into M. silvestris by electroporation, and deletion strains were selected 
on plates containing kanamycin. The kanamycin cassette was subsequently removed 
by expression of the Cre recombinase following introduction of plasmid pCM157 
(ref. 35). The same procedure was followed for the deletion of prmA, using the pri- 
mers 1651Af (5’-GATCGAGCTCTAGTCGGCTACGGCTATTATGG-3’) and 
1651Ar (5’-GAGAACGCGTGGCGCCTAACGAACTTTCTTTG-3’), and 1651Bf 
(5'-GATCGGTACCTCATGGGAGGCGATGGATTG-3’) and 1651Br (5’-GTC 
CGCTGACGGTGACTTTG-3’), except that the B fragment was excised from 
pCR2.1 TOPO using KpnI/EcoRI and ligated into these sites of pCM184. The inten- 
ded gene deletions were verified by PCR using the primers 1262Tf (5'-CCCAGTT 
CCATTCGTAAGAC-3’) and 1262Tr (5’-GTATTCGCTGAACAGCAAGG-3’), 
and 1651Tf(5'-AAGGCCGCGTCCGATACAAG-3’) and 1651Tr (5’-CAGAAC 
AAATCGGCCTGGGTCC-3’), which are located outside the cloned regions, and 
by sequencing. 

Proteomic analyses. Cells were lysed for proteomic analysis by three passages 
through a French pressure cell press (American Instrument Company) at 110 MPa 
(on ice). Cell debris was removed by centrifugation (10,000 g, 15 min, 4 °C). Proteins 
were separated by SDS-PAGE, and bands of interest were excised from the gels for 
the identification of polypeptides by the Biological Mass Spectrometry and Prote- 
omics Facility in the School of Life Sciences, University of Warwick. Coomassie-blue- 
stained gel pieces were processed and tryptically digested using the manufacturer’s 
recommended protocol on the MassPREP robotic protein handling system. The ex- 
tracted peptides from each sample were analysed by nano liquid-chromatography 
electrospray-ionization tandem mass spectrometry (LC-ESI-MS/MS) using Nano 
Acquity/Q-Tof Ultima Global instrumentation (Waters) with a 45-min liquid chro- 
matography gradient. All MS and MS/MS data were corrected for mass drift using 
reference data collected from human [Glu’]-fibrinopeptide B (catalogue F3261, 
Sigma) sampled each minute of data collection. The data were used to interrogate 
an M. silvestris database appended with the common Repository of Adventitious 
Proteins sequences (http://www.thegpm.org/cRAP/index.html) using the ProteinLynx 
Global Server v2.4, as previously described**. Polypeptides were identified based 
on a minimum of two peptides. 

Activity assays. A Clark-type oxygen electrode equipped with an OXY040A cell 
connected to a Digital Model 10 controller (Rank Brothers) was used to detect 
substrate-induced oxygen consumption by whole cells, in a 3 ml working volume. 
Between 1 mg and 5 mg dry weight of cell suspension was added to 3 ml oxyge- 
nated 40 mM phosphate buffer (pH 5.5) in the instrument cell maintained at 25 °C 
using a circulating water bath. The instrument was operated and calibrated by 
comparison with air-saturated water as described previously”’. The substrate (7.5- 
15,000 nmol) was added, and the substrate-induced rate was calculated by sub- 
tracting the endogenous rate from the rate following the addition of substrate. 
Gaseous substrates were prepared as saturated aqueous solutions in 120 ml serum 
vials containing 25 ml water flushed with at least ten volumes of the substrate gas, 
and the concentration was calculated using the Henry’s Law constants”. Quino- 
protein alcohol dehydrogenase was assayed using the artificial electron acceptor 
phenazine methosulphate (PMS) coupled to reduction of dichlorophenolindophenol 
(DCPIP), as described previously**. Reactions (1 ml) contained 100 mM Tris buffer 
(pH 9.0), 1 mM PMS, 0.08 mM DCPIP, 15 mM NH4,Cl, 20 pg protein and 10 mM 
substrate. Reactions were initiated with the addition of ammonium and followed 
spectrophotometrically at 600 nm. 


©2014 Macmillan Publishers Limited. All rights reserved 


Statistical analysis. The differences in growth rate, culture density and carbon con- 
version efficiency shown in Fig. 1 were evaluated using single factor analysis of variance 
(ANOVA) followed by Tukey’s honest significant difference (HSD) method. To 
identify significant differences in Fig. 3, Student’s t-test (two-tailed, two samples, 
assuming equal variance) was applied to the two growth conditions, using Micro- 
soft Excel. 
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Extended Data Figure 1 | Methylocella silvestris grown on propane 2-propanol and acetone was observed once again. The cells and medium were 
(20% (v/v) propane in air) in a fermenter (period between 62 and 106 days _ removed, and fresh medium was added on day 99. These data demonstrate 
shown). The 2-propanol and acetone in the culture medium reached that 2-propanol and acetone result from propane oxidation and that cells 


approximately 16 mM and 9 mM, respectively. When the propane supply was _ growing on propane in the presence of these intermediates can metabolize them 
shut off (days 92-96), the 2-propanol and acetone concentrations decreased to _ without an appreciable lag phase. The 2-propanol and acetone concentrations 
nearly zero, whereas the culture density continued to increase without show the mean + s.d. of triplicate measurements. 

interruption. Following the resumption of the propane supply, accumulation of 
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Extended Data Figure 2 | Phylogeny of SDIMOs. a, Phylogenetic 
relationships between the two M. silvestris SDIMOs (underlined) and other 
representative enzymes. The tree, constructed using the maximum likelihood 
method, is based on an alignment of amino acid sequences of the «-subunit 
of the hydroxylases. The sequences were aligned using Clustal; positions 
containing gaps or missing data were eliminated; and the tree was constructed 
with a final data set of 356 amino acids using MEGA 5 (ref. 32). Bootstrap 
values (based on 500 replications) greater than 95% are shown as filled circles 
at nodes, and those between 75% and 95% as open circles. The SDIMO 
subgroups”””” are indicated on the right of the figure. GenBank accession 
numbers (in order from the top): AAC45289.1, ABD46892.1, ZP_06887019.1, 
ABD46898.1, CAD30366.1, YP_002361593.1, CAD88243.1, BAE86875.1, 
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YP_113659.1, BAA84751.1, BAJ17645.1, AAM19727.1, BAF34294.1, 
ACZ56324.1, AAO48576.1, YP_919254.1, BAA07114.1, AAS19484.1, 
CAC10506.1, YP_700435.1, BAF34308.1, BAD03956.2, YP_002361961.1, 
YP_001834443.1, YP_001020147.1, NP_770317.1, YP_352924.1, AAL50373.1, 
P19732.1, AAT40431.1, YP_001409304.1 and CAB55825.1. b, The propane 
monooxygenase gene cluster (shown in red). The structural genes (hydroxylase 
a.-subunit, reductase, hydroxylase B-subunit and coupling protein) are 
followed by those encoding a putative chaperone prmG (ACK50595.1) and 
regulatory protein prmR (ACK50594.1), with homology to mmoG and mmoR 
of sMMO. The putative promoter sequence is shown above in relation to the 
ATG start codon, together with the consensus sequence of o** promoters as 
described previously*’. 
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Extended Data Figure 3 | M. silvestris polypeptide profiles. a, An extracts. b, Polypeptide identifications of the gel bands shown in a. For each 
SDS-PAGE gel loaded with soluble extract from wild-type cells grown on band, the four most abundant polypeptides are shown, except where fewer than 
methane (M), propane (P), succinate (S), 2-propanol (2-P) or acetone (A). four were detected. In addition, all sMMO-related and PrMO-related 
Prominent bands (identified with boxes in the right-hand photograph of the _ polypeptides identified (irrespective of the number of peptides used for 
same gel) evident following growth on methane or propane, but not on identification) are included. Otherwise, polypeptides identified with at least 
succinate, were excised from the gel and analysed by mass spectrometry. three peptides are included (except for the succinate lane; four peptides). Other 


The polypeptide identifications are shown in b. The data show that both sMMO _ identified polypeptides are not shown. The number of peptides used for the 
and PrMO were expressed during growth on propane but that PrMO subunits _identification of each polypeptide is shown. The total number of peptides 
were not expressed at a high level during growth on methane. PrMO subunits, detected from all of the polypeptides identified in each band is shown for 
but not sMMO subunits, were expressed during growth on 2-propanol. In comparison. DH, dehydrogenase; MM, theoretical molecular mass; PEP, 
addition, gel-free analysis of the complete soluble proteome” did not result in phosphoenolpyruvate. 

the detection of PrMO polypeptides in succinate-grown or methane-grown cell 
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Extended Data Figure 4 | Growth of M. silvestris strains. a, Growth of wild-type M. silvestris on a mixture of methane (red, crosses) and propane 
wild-type M. silvestris (solid lines, filled symbols) and the APrmA strain (purple, triangles) (2.5% (v/v) each). The inoculum for the cultures was grown 
(dashed lines, open symbols) on methane (circles) or propane (triangles) (2.5% on methane and propane (2.5% (v/v) each). Data points show the mean + s.d. 
(v/v)). The inset shows the substrate carbon conversion efficiency (CCE) of triplicate vials. c, d, Growth of and consumption of methane and propane 


(mg of cellular protein formed per mg of substrate carbon consumed) for the _ by wild-type M. silvestris (c) and the AMmoxX strain (d) when supplied with an 
wild-type M. silvestris (dark grey) and the APrmA strain (light grey) during approximately 1:10 (v/v) ratio of gases, showing that PrMO did not oxidize 

growth on methane (M) or propane (P). Data are the mean + s.d. for three appreciable amounts of methane. Methane concentrations are shown as % (v/v) 
(methane) or two (propane) vials. b, Growth and substrate consumption of X 10 on the secondary (right) y axes. Data are the mean + s.d. of triplicate vials. 
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Extended Data Figure 5 | Methane and propane oxidation kinetics. 
a, Maximum methane-induced and propane-induced specific oxygen 
consumption rate (nmol min’ ' per mg dry weight) of whole M. silvestris cells 
grown on methane, propane or succinate, as determined by oxygen electrode 
studies, with approximately 200 1M oxidation substrate. Data are the 


[Substrate] (UM) 


mean = s.d. of three measurements. b, c, Kinetics of methane-induced (b) or 
propane-induced (c) oxygen consumption in whole cells grown on that 
substrate. The Hanes—Woolf plots show substrate concentration (S) divided by 
oxygen consumption rate (v) as a function of S, and the trend line cuts the x axis 
at —Kim. 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


a b 8 
2.5 O methane s O methane 
Cell growth Cell growth 
BH propane @ propane 
substrate ; substrate : 
7) @ succinate L B® succinate 
£ © —~ 60 
5 55 
wn ‘ no} 
$15 & oo 
s 5 : 40 
fe) c 
= c's 
=. 4 8 = 
2 Gf 
[e) We 
E = — 20 
0.5 : °|@ lt fo) 
foo) © = 
a 
) fe) 
Assay substrate Assay substrate 
Cc 
Assay substrate 
Growth substrate 
2-propanol 1- propanol 
methane 63 +15 871+13 
propane 381 +30 363 +7 


Extended Data Figure 6 | Wild-type M. silvestris activity assays. 

a, Stoichiometry of whole-cell oxygen consumption in response to addition of 
100-250 nmol methanol, 1-propanol or propanal. Cells from all growth 
conditions consumed approximately equimolar amounts of oxygen when 
methanol was added; however, when 1-propanol or propanal replaced 
methanol, the amount of oxygen consumed by propane-grown cells increased 
to threefold (1-propanol) or fivefold (propanal) that of methane-grown or 
succinate-grown cells. These data suggest that the oxidation of terminal 
intermediates proceeds further in propane-grown cells, although enzymes that 
can oxidize 1-propanol and propanal may be present in all cell types, as shown 
in b and c. Data are the mean + s.d. of the number of measurements shown. 
b, Oxygen uptake rates of M. silvestris cells grown on methane, propane or 
succinate in response to the addition of the substrates shown. High rates of 
oxygen consumption were recorded in response to methanol, 1-propanol and 
propanal in cells grown on either methane, propane or succinate. However, the 
oxygen consumption rates in response to 2-propanol, acetone and acetol were 


at least twofold, fourfold or sevenfold higher, respectively, in propane-grown 
cells than in methane-grown or succinate-grown cells, demonstrating that the 
ability to oxidize intermediates of the sub-terminal oxidation pathway was 
induced during growth on propane. Substrates were used at a final 
concentration of 5 mM. Data are the mean ~ s.d. of three measurements 
(except for methane/methanol, n = 7). c, Quinoprotein alcohol dehydrogenase 
activity (mean + s.d. of three measurements) assayed as DCPIP reduction 
(nmol min“! per mg protein) in soluble extract from methane-grown or 
propane-grown cells. The activity in cell extracts from both growth conditions 
was high when 1-propanol was the substrate (probably as a result of the 
constitutive expression of methanol dehydrogenase); however, with 2-propanol 
as the substrate, the activity was sixfold higher in the extract from cells grown 
on propane than cells grown on methane. By contrast, the rates of NAD*- 
linked or NADP*-linked 1-propanol or 2-propanol dehydrogenase activity 
were less than 10 nmol min’! per mg protein in cell extracts from each growth 
condition (data not shown). 
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Extended Data Figure 7 | 1-Propanol-metabolizing ability was induced in _not inhibit growth on succinate, suggesting that 1-propanol-metabolizing 


propane-grown cells. a, When vials were inoculated with methane-grown potential was induced in cells grown on propane and could be maintained 
cells, growth was possible on 2-propanol but not on 1-propanol at any during growth on, or in the presence of, 1-propanol. The concentrations used 
concentration tested. b, In addition, 1-propanol completely inhibited growth — were 0.05% (w/v) 1-propanol, 0.05% (v/v) 2-propanol, 3 mM succinate (except 
on 2-propanol. c, When using succinate-grown inoculum, 1-propanol also d, 5 mM), or as indicated. Data are the mean + s.d. of duplicate (a, c d) or 


greatly inhibited growth on succinate. d, However, when using an inoculum triplicate (b) vials. 
grown on propane, limited growth occurred on 1-propanol, and 1-propanol did 
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Extended Data Figure 8 | Detection of oxidation products in wild-type 

M. silvestris cultures. a, During growth on 4% (v/v) propane, 2-propanol 
was detected in the culture medium during the mid-exponential and late 
exponential phase, before declining from a maximum of approximately 0.5 mM 
at 210h to below the limit of detection at stationary phase (300 h). The culture 
density (ODs49) is shown in black; propane concentration, in purple; and 
2-propanol concentration, in red; with solid lines and filled symbols. Control 
vials, which contained cells killed by autoclaving, are shown as dotted lines and 
open symbols. The 2-propanol concentration is shown as X4 on the secondary 


(right) y axis. Data are the mean + s.d. of triplicate vials. b, For the cultures 
shown in a, the amount of 2-propanol present at each time point (including the 
amount previously removed during sampling) is expressed as a percentage of 
the propane consumed. At 94h, 25% of the propane consumed could be 
accounted for as 2-propanol in the growth medium. This value therefore 
represents the minimum percentage of propane that is oxidized to 2-propanol, 
because no allowance has been made for the consumption of 2-propanol by the 
cultures. Data are calculated from the mean + s.d. of triplicate vials. 
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Extended Data Figure 9 | Oxidation products of the APrmA strain and 
growth rates of the APrmA, wild-type and inhibited wild-type strains. 

a, Culture density (black lines, triangles) and 2-propanol (red lines, circles) and 
acetone (purple lines, diamonds) concentrations during growth of the APrmA 
strain on 4% (v/v) propane. This strain accumulated 1.3 mM 2-propanol, but 
neither 2-propanol nor acetone were detected during growth of the AMmoX 
strain on 20% (v/v) propane (data not shown), suggesting that 2-propanol is 
one (or the major) product of propane oxidation by sMMO. b, Growth rate of 
wild-type M. silvestris (dark grey) and the APrmA strain (light grey) on the 
substrates shown. Disruption of PrMO decreased growth on 2-propanol 
compared with the wild-type, although growth on acetone, acetate, methanol or 


ethanol was not significantly affected, implicating PrMO in 2-propanol 
metabolism. Data are the mean + s.d. of three vials (except 2-propanol, n = 6 
(APrmA strain) or n = 5 (wild-type)). Significance was determined by 
Student’s t-test (**, P< 0.01). c, Growth of wild-type M. silvestris on acetone 
(squares), 2-propanol (circles) or propane (10% (v/v)) (triangles) either without 
an inhibitor (filled symbols, solid lines) or with acetylene (2% (v/v)) as an 
inhibitor (open symbols, dashed lines). Data are the mean + s.d. of triplicate 
vials. For 2-propanol, the specific growth rates in the early exponential phase 
were 0.020 + 0.0002 h~! and 0.018 + 0.0001 h_! (mean = s.d.) for uninhibited 
or inhibited cultures, respectively. 
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Substrate Concentration Growth 
Formate 1mM - 
Formate 5 mM, 40 mM + 
Formate 25 mM ++ 
H2/CO>2 20% O2, 0.5 to 10% CO2, balance Hz - 
Methane 2.5% to 20% ++ 
Methanol 0.05% ++ 
Methylamine 5mM ++ 
Acetate 5mM ++ 
Ethane 20% + 
Ethene 10% - 
Ethanol 0.05% ++ 
Glycine 5mM - 
Glycollate 5mM - 
Glyoxylate 1 mM, 5mM - 
Oxalate 1mM, 5mM - 
Urea 5mM - 
1,2-propanediol 0.05% ++ 
1-propanol 0.01 to 0.1% -* 
2-propanol 0.05% ++ 
Acetol 0.05% ++ 
Acetone 0.05% ++ 
Glycerol 5mM ++ 
Methyl acetate 0.05% + 
Propane 2.5% to 20% ++ 
Propene 10% - 
Propionate 5mM ++ 
Propionate 20 mM - 
Pyruvate 5mM ++ 
Butane 20% - 
Trans-2-butene 10% - 
Malate 5mM ++ 
Succinate 5mM ++ 
Tetrahydrofuran 0.05% + 
Pentane 0.1% - 
Gluconate 10 mM ++ 
Pentadecane 0.1% - 


Extended Data Table 1 | Substrate utilization by Methylocella silvestris 


LETTER 


Growth is indicated as: ++, growth to ODsqo > 0.25; +, growth to ODsq4o = 0.08-0.25; —, growth to ODsq4o < 0.08. (Concentrations in % are given as v/v.) 
* Growth on 1-propanol was dependent on the inoculum used for the cultures. 
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T-helper-17 (T}17) cells have critical roles in mucosal defence and 
in autoimmune disease pathogenesis’ *. They are most abundant 
in the small intestine lamina propria, where their presence requires 
colonization of mice with microbiota* ’. Segmented filamentous bac- 
teria (SFB) are sufficient to induce T};17 cells and to promote Ty17- 
dependent autoimmune disease in animal models**. However, the 
specificity of T};17 cells, the mechanism of their induction by distinct 
bacteria, and the means by which they foster tissue-specific inflam- 
mation remain unknown. Here we show that the T-cell antigen recep- 
tor (TCR) repertoire of intestinal T};17 cells in SFB-colonized mice 
has minimal overlap with that of other intestinal CD4* T cells and 
that most T}17 cells, but not other T cells, recognize antigens encoded 
by SEB. T cells with antigen receptors specific for SFB-encoded pep- 
tides differentiated into RORyt-expressing T}17 cells, even if SFB- 
colonized mice also harboured a strong Ty1 cell inducer, Listeria 
monocytogenes, in their intestine. The match of T-cell effector func- 
tion with antigen specificity is thus determined by the type of bacteria 
that produce the antigen. These findings have significant impli- 
cations for understanding how commensal microbiota contribute 
to organ-specific autoimmunity and for developing novel mucosal 
vaccines. 

How SEB induces T};17 cells and how these cells contribute to self- 
reactive pathological responses remain key unanswered questions. A 
recent study, using mice with monoclonal TCRs, suggested that induc- 
tion of Tj17 cells by SFB or other microbiota is independent of cognate 
antigen recognition’’. To further evaluate mucosal effector T-cell in- 
duction in a physiological setting, we undertook an examination of the 
repertoire and specificity of naturally arising Tj17 cells. To facilitate 
analysing live Ty17 cells, we used 1123r“"” reporter mice’, as among 
CD4* T cells, only this subset expresses IL-23R. We first asked if small 
intestine lamina propria (SILP) T};17 cells are in general responsive to 
gut luminal commensal antigens. GFP* (Ty17) and GEP” (non-Ty;17) 
CD4°* T cells, purified from 1123r°F"’/* C57BL/6 (B6) mice that had 
been colonized with SFB, were incubated with splenic antigen-presenting 
cells (APCs) and autoclaved small intestinal luminal content of mice 
from the Jackson laboratory (Jackson) and Taconic Farms (Taconic). 
We used the measure of forward scatter (FSC) as a surrogate readout 
for T-cell activation. Intriguingly, only Ty17 cells mounted a detectable 
response to Taconic antigens (Extended Data Fig. 1a). SFB is one of the 
bacteria unique to Taconic flora*. Thus we repeated the assay with faecal 
material from SFB-monoassociated mice (SFB-mono antigens) and de- 
tected a robust response only among GFP” cells (Fig. 1a). These cells 
did not respond to major histocompatibility complex class II (MHCII)- 
deficient APCs loaded with SFB-mono antigens, indicating that the acti- 
vation was dependent on antigen presentation (Extended Data Fig. 1b). 
SFB-mono antigens selectively stimulated total CD4* T cells from B6 
Taconic mice, but not those from B6 Jackson mice, consistent with 


in vivo priming of SFB-specific T}417 cells (Fig. 1b), and any bystander 
effect in this assay was negligible (Extended Data Fig. 1c). Next, we used 
an IL-17A ELISPOT assay to quantify the percentage of T};17 cells from 
SFB-colonized mice responding to commensal antigens. GFP” cells had 
a relatively weak response towards Jackson antigens, but had a robust res- 
ponse towards Taconic antigens. Significantly, SFB mono-associated 
mouse faecal antigens stimulated over 60% of the T}717 cells (Fig. 1c). 
In contrast, there was no response of T};17 cells to faecal material from 
germ-free mice (data not shown). Thus, the majority of T}17 cells in 
the SILP of SFB-colonized mice react with SFB-derived antigens, whereas 
a small proportion respond to non-SFB antigen, indicating that most 
TyI7 cells are specific for bacteria in the intestinal lumen. 

We wished to compare the TCR repertoires of T};17 cells and those 
of non-T}17 cells. Using antibodies against a panel of TCR Vs, we 
observed a higher proportion of VB14~ T cells in Ty17 cells than in 
non-T};17 cells from the SILP (Extended Data Fig. 2a, b). This bias was 
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Figure 1 | Intestinal T,17 cells are specific for SFB- and other microbiota- 
derived antigens. a, Selective activation of intestinal GEP* CD4* T cells from 
1123r%""’* mice by faecal extract from SFB-monoassociated mice. Forward 
scatter (FSC) was evaluated after 2 days. b, Activation of SILP CD4* T cells 
from B6 Taconic mice and B6 Jackson mice with faecal extract from SFB- 
monoassociated mice. Jax, Jackson mice; Tac, Taconic mice. c, IL-17A 
ELISPOT assay of intestinal GEP* CD4* T cells from SFB-colonized 
1123re""’* mice treated with indicated stimuli. Left, representative ELISPOT 
images. Right, compilation of results from multiple animals. Each symbol 
represents cells from a separate animal. 
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Figure 2 | Most Ty17 TCR hybridomas recognize SFB-unique proteins. 
a, Responses of the TCR hybridomas, prepared from Ty17 and non-Ty17 
intestinal CD4* T cells, to faecal material from SFB-monoassociated mice. 
b, Responses of the TCR hybridomas to Escherichia coli clones expressing 
full-length SFBNYU_003340 and SFBNYU_004990. Note that a non-T}y17 TCR 


recapitulated when the CD4* T cells were stained with antibodies specific 
for RORyt and IL-17A, two other characteristic markers of Ty17 cells 
(Extended Data Fig. 2c). However, intracellular staining for IFNy and 
FOXP3 indicated no VB14* cell biasamong Ty] and T regulatory cells 
(Extended Data Fig. 2c). To determine if the VB14 enrichment of T}17 
cells is influenced by microbiota, we compared SFB-free B6 Jackson 
mice with SFB-colonized B6 Taconic mice. The Jackson mice had few 
RORyt* Ty17 cells, and there was no enrichment of VB14* cells 
among them. In contrast, Jackson mice cohoused with Taconic mice 
had increased numbers of lamina propria T};17 cells, which were 
enriched for VB14* TCRs (Extended Data Fig. 2d), indicating that 
the T},;17 repertoire is shaped by specific microbiota. 

We chose to focus on VB14" cells to further elucidate the gut CD4* 
T-cell repertoire. First, we used pyrosequencing to examine the rep- 
ertoire of VB14* SILP Ty17 and non-Ty17 cells from SFB-colonized 
mice. The complementarity-determining region 3 (CDR3) of VB14 was 
characterized for each cell population from eight 1123r°'"”* mice. Each 
sample contained a minimum of several hundred unique CDR3 sequences 
(Extended Data Fig. 3a). Interestingly, the ten most frequently used 
unique CDR3 sequences accounted for 60% of the T}17 and only 40% 
of the non-T}17 repertoire (Extended Data Fig. 3b). Furthermore, the 
dominant CDR3 sequences in individual mice exhibited a clear bias 
towards either T}17 or non-T}17 cells (Supplementary Table 1). Many 
of these CDR3 sequences were shared between mice and were enriched 
either in Ty17 or in non-Ty17 cells in individual mice (Extended Data 
Fig. 3c). 

The finding that intestinal T}17 cells have a distinct repertoire 
prompted us to further determine their antigen specificity. Thus, we sorted 
single T cells from four mice and sequenced their VB14 and paired Va 
chains (Extended Data Fig. 4a). Notably, each mouse carried some VB14 
sequences that were present in several sorted cells, and these sequences 
strongly biased towards Ty17 or non-Ty17 cells (Extended Data Fig. 4b), 
corroborating our findings from the high-throughput sequencing 


hybridoma also responded to the clone expressing SFBNYU_004990. 

c, Summary of the nineteen dominant clonotypic TCR clones. Ten T}y17-biased 
clones are highlighted in green, and eight non-T};17-biased clones are 
highlighted in red. d, Features of the two antigenic proteins of SFB. FPKM, 
fragments per kilobase of transcript per million mapped reads. 


analysis. To define the antigen specificity of the TCRs from intestinal 
Ty17 and non-Ty717 cell clones, we expressed a cohort of nineteen pre- 
dominant clonotypic TCRs (ten Ty17 clones, eight non-T};17 clones, 
and one neutral clone) ina NFAT-GFP* hybridoma that can report on 
TCR signalling’”. Upon co-culture of the hybridomas with splenic APCs 
and heat-inactivated mouse intestinal luminal content, several T}17- 
TCR hybridomas, but not non-T};17-TCR hybridomas, responded to 
Taconic antigens, and not to Jackson antigens (Extended Data Fig. 4c). 
Furthermore, when SFB-mono antigens were used, we detected res- 
ponses from 7/10 T};17-TCRand the neutral TCR hybridoma, but none 
of the non-T};17 cell hybridomas (Fig. 2a). These responses were abro- 
gated if the APCs were from MHCII-deficient mice (Extended Data Fig. 4d). 

We next sought to identify epitopes recognized by Ty17 cell TCRs 
using a whole-genome shotgun cloning and expression screen, an unbi- 
ased approach previously used to identify T-cell antigens from other 
bacteria'* (Extended Data Fig. 5a). One bacterial clone, designated 3F12- 
E8, stimulated 7B8 and four other Ty17-TCR hybridomas (Extended 
Data Fig. 5b-d). Based on the recent annotation of the SFB genome’, 
we assigned the 672-base pair 3F12-E8 insert to an SFB gene (SFBNYU_ 
003340"). We confirmed the specificity by cloning the full-length gene 
and demonstrating that its product stimulated the aforementioned five 
TCRs, but not any other TCRs (Fig. 2b, left). We further mapped a min- 
imal epitope that stimulated all five TCRs and a shorter 8-amino-acid 
epitope that stimulated only the 7B8 and 2A6 hybridomas (Extended 
Data Fig. 5e). 

Another expression screen was performed using the 1A7 hybridoma, 
which along with three other TCRs formed a distinct cluster with an 
identical Vo and highly similar VB14 CDR3 sequences (Extended Data 
Fig. 6a). A stimulatory clone, designated 2D10-A10 (Extended Data 
Fig. 6b, c), contained the amino-terminal sequence of another SFB gene 
(SFBNYU_004990"’). We mapped the epitope for the 1A7 hybridoma 
to 9 amino acids (Extended Data Fig. 6d). Both the full-length gene pro- 
duct and a 9-amino-acid peptide stimulated all four TCRs, indicating that 
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these TCRs indeed recognize the same epitope (Fig. 2b, right). However, 
the single TCR derived from non-Ty17 cells (3F4) displayed a much 
weaker dose-response to peptide antigen than the other TCRs (Extended 
Data Fig. 6e). 

Thus, eight out of eleven VB14~ T,417-TCR hybridomas recognized 
two distinct antigens encoded by SFB (Fig. 2c). Both proteins are unique 
to SFB, expressed at a medium to high level, and predicted to be secreted 
or at the cell surface (Fig. 2d). Importantly, primary VB14* Ty17 cells 
responded to the two immunodominant SFB epitopes (Extended Data 
Fig. 7a). Although VB14 * cells consistently responded slightly better, 
VB14 Ty17 cells were also stimulated by SFB (Extended Data Fig. 7b), 
indicating that these cells respond to other SFB epitopes. An in silico 
search was conducted for potential epitopes within the SFB proteome 
(Extended Data Fig. 7c, d), which yielded several more stimulatory pep- 
tides (Extended Data Fig. 7e). Among these, peptide N5, also derived 
from SFBNYU_003340, was a strong stimulator of intestinal Tj17 cells, 
activating both VB14" cells and VB14_ cells (Extended Data Fig. 7f). 
Thus, in the small intestine, SFB is the dominant antigen source for poly- 
clonal T};17 cells, but for few, if any, non-T}17 cells. 

Wethen asked what happens to T cells expressing SFB-specific TCRs. 
We generated 7B8 TCR transgenic mice (7B8Tg)”, and transferred 
naive T cells from these mice into isotype-marked congenic B6 mice”. 
After one week, we readily detected donor-derived T cells in the SILP 
of mice that had been exposed to SFB, whereas they were completely 
absent in SFB-deficient recipients (Extended Data Fig. 8a). Remark- 
ably, almost all donor-derived cells became positive for RORyt (Fig. 3a). 
Similar results were obtained upon transfer of T cells from two other TCR 
transgenic strains (1A2Tg and 5A11Tg) into SFB-colonized recipient mice 
(Extended Data Fig. 8b). The donor-derived T cells lacked expression of 
the transcription factors associated with alternative CD4 T-cell programs 
(for example, FOXP3, GATA3, and T-bet) (Extended Data Fig. 8c). 

To visualize endogenous SFB-antigen-specific T cells, we produced 
MHCII-tetramers containing peptide A6 from SFBNYU_003340 (3340- 
A6 tetramer)”. The I-A°/3340-A6 tetramer specifically stained GFP‘ 
SILP CD4* T cells from SFB-colonized 1123r°""’* mice (Extended 
Data Fig. 8d). Furthermore, a sizable population of I-A°/3340-A6 tetramer- 
positive cells was present in B6 Taconic, but not in B6 Jackson mice (Ex- 
tended Data Fig. 8e), and these cells were uniformly ROR}t-positive, 
indicating that they were SFB-elicited T},17 cells (Fig. 3b). 

We next aimed to determine whether polarization of the antigen- 
specific T};17 cells in response to SFB colonization is dictated by the nature 
of the antigenic protein or properties of the microbe. Listeria mono- 
cytogenes, an enteric pathogenic bacterium that also colonizes the small 
intestine, typically elicits a Tyy1 response”. Mice were orally infected 
with L. monocytogenes expressing SFBNY U-003340 (Listeria-3340) (Ex- 
tended Data Fig. 8f) or SFB before intravenous transfer of 7B8Tg T 
cells. 7B8Tg T cells accumulated in the SILP of both sets of mice, but, 
importantly, they expressed T-bet rather than RORyt when the hosts 
were colonized with Listeria-3340 (Fig. 3c). 

To further investigate a relationship between the fate of SILP T helper 
cells and the bacterial origins of antigens, we transferred 7B8Tg T cells 
into mice that were colonized with both SFB and Listeria and simul- 
taneously tracked CD4* T-cell responses specific for both bacteria in 
the SILP using the Ly5.1* congenic marker for 7B8T¢ cells and lister- 
iolysin O (LLO)-tetramers that stain endogenous Listeria-specific T cells 
derived from the host (Extended Data Fig. 9a). In the presence of both 
Ty17- and Ty1-inducing bacteria, 7B8T¢ T cells expressed RORYt, but 
not T-bet, whereas LLO-tetramer~ cells expressed T-bet, but not RORyt 
(Fig. 4a and Extended Data Fig. 9b, c). This result is in contrast to the 
Ty polarization of TCR transgenic T cells specific for the commensal 
CBirl flagellin antigen observed upon infection with the protozoan par- 
asite Toxoplasma gondii**, a T};1-inducing intestinal pathogen. This 
suggests that, unlike CBirl-encoding Clostridia, SFB has the ability to 
direct a dominant signal specialized for induction of Ty17 cells. 

SFB colonization of the small intestine is potentially beneficial, atten- 
uating pathogenic bacteria-induced colitis’, but it can also trigger or 
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Figure 3 | SFB-specific T cells become Ty17 cells in the SILP. a, 7B8T¢ cells 
(Ly5.2) were transferred into SFB-colonized mice (Ly5.1), and SILP T cells were 
analysed after 8-15 days. Left, representative FACS plots. Right, analysis of 
multiple animals (one symbol per animal). b, I-A°/3340-A6 tetramer stain of 
SILP T cells from SFB-colonized B6 mice. Left, representative FACS plots. 
Right, analysis of multiple animals (one symbol per animal). c, 7B8Tg cells 
(Ly5.1) were transferred into Ly5.2 congenic hosts orally colonized with 
Listeria-3340 or SFB. Seven days after transfer, donor-derived cells in the SILP 
were analysed. Analyses of cells from different mice are shown in the composite 
FACS plot. The results are representative of three experiments. 


exacerbate systemic autoimmune disease’®”’, raising the question as to 
whether SFB-specific T}17 cells can circulate beyond the small intest- 
ine. We examined the colons and spleens of SFB-positive recipients of 
7B8T¢g naive T cells, and found these cells in both organs. Importantly, 
more than 80% of these SFB-specific T cells in colon and 40% in spleen 
expressed RORyt (Fig. 4b). Consistent with this result, staining of endog- 
enous T cells from Taconic mice revealed 3340-A6 tetramer-positive cells 
in the large intestine and most of these cells expressed RORyt (Ex- 
tended Data Fig. 10a, b). 

Our results therefore indicate that intestinal antigen-specific CD4* 
T cells differentiate to become either Ty;1 or Ty17 cells, depending on 
which luminal bacterium delivers the antigen. We propose a determin- 
istic model for T helper cell differentiation whereby the bacterial con- 
text of cognate antigen delivery dictates the fate of the antigen-specific 
T cells (Fig. 4c). Our work opens the way towards elucidating the mech- 
anisms of T}y17 cell induction by microbiota and of how gut-induced 
Ty17 cells can contribute to distal organ-specific autoimmune disease. 
In addition, it serves as a guide for future studies of human commensal- 
specific pro-inflammatory T cells that are thought to contribute to auto- 
immune diseases such as rheumatoid arthritis”. Finally, the demonstration 
of controlled polarized T-cell responses towards commensal bacteria 
offers the potential for novel approaches towards mucosal vaccination. 
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METHODS SUMMARY 

Mice. All mice were housed in the animal facility of The Skirball Institute of Bio- 
molecular Medicine at the New York University School of Medicine. Experimental 
protocols were approved by the Institutional Animal Care and Use Committee. C57BL/ 
6 mice were purchased from Taconic Farm (B6 Taconic) or the Jackson Laboratory 
(B6 Jackson). 112370"? mice", a gift from M. Oukka, were maintained by breeding 
with B6 Taconic mice. 7B8Tg, 1A2Tg and 5A11Tg SFB-specific TCR transgenic 
(Tg) mice were generated as previously described” and kept with SFB-minus flora. 
For adoptive transfer, naive Tg T cells (CD62L" cp44’° VB14~ CD4* CD3*) were 
sorted from the spleen and were injected intravenously into congenic recipient mice”. 
Generation of TCR hybridomas. Retroviruses carrying an expression cassette en- 
coding TCRa, TCRB, and CD4 were used to infect the NFAT-GFP 580 8 hy- 
bridoma cell line’. 

Construction and screen of whole-genome shotgun library of SFB. The shot- 
gun library was prepared with a procedure modified from a previous study'®. The 
library is estimated to contain 10* clones. The expression of exogenous proteins was 
induced by isopropylthiogalactoside for 4h. For antigen screening, pools of heat- 
killed bacteria (~30 clones per pool) were added to a co-culture of APCs and hybridomas. 
MHCII tetramer production and staining. MHCII/3340-A6 tetramer was pro- 
duced as previously described”’. SILP T cells were incubated at room temperature 
for 60 min with fluorochrome-labelled tetramer (10 nM) before staining with rel- 
evant antibodies at 4°C. 

Heterologous expression of SEBNYU_003340 in Listeria monocytogenes. The 
entire coding region of SFBNYU_003340, including its predicted signal sequence, 
was sub-cloned into the Listeria expression vector pIMK2”. The resultant plasmid 
was transformed into electrocompetent Listeria monocytogenes strain 10403S-inlA™ 
and plated on selective medium containing kanamycin (50 jig ml~!)**. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 

Mice. C57BL/6 mice were purchased from Taconic Farm (B6 Taconic) or Jackson 
Laboratory (B6 Jackson). 1123r°"? mice'® were provided by M. Oukka and main- 
tained by breeding with B6 Taconic mice. Ly5.1 mice (B6.SJL-Ptprca Pepcb/Boy]) 
and MHCII-deficient mice (B6.129S2-H2"“"!“*/)) were from Jackson Laboratory. 
Antibodies and flow cytometry. The following antibodies were from eBiosciences, 
BD Pharmingen or BioLegend: V2 (B20.6), VB3 (KJ25), VB4 (KT4), VB5 (MR9- 
4), VB6 (RR4-7), VB7 (TR310), VB8 (F23.1), VB8.1/8.2 (MRS-2), VB8.3 (SCL), 
VB10 (B21.5), VB11 (CTVB11), VB12 (MR11-1), VB14 (14-2), CD3 (145-2C11), 
CD4 (RM4-5), CD25 (PC61), Ly5.1 (A20), Ly5.2 (104), MHCII (M5/114), RORyt 
(AFKJS-9 or B2D), FOXP3 (FJK-16s), T-bet (eBio4B10), GATA3 (TWAJ), IL- 
17A (eBiol17B7) and IFN-y (XM61.2). Flow cytometric analysis was performed on 
an LSRII (BD Biosciences) or an Aria II (BD Biosciences) and analysed using FlowJo 
software (Tree Star). DAPI (Sigma) was used to exclude dead cells. 

T-cell preparation and staining. Small intestine lamina propria were minced and 
then incubated for 30 min at 37 °C with collagenase D (1 mg ml !; Roche), dispase 
(0.05 U ml}; Worthington) and DNase! (100 pg ml 1 Sigma). Lymphocytes were 
collected at the interface of a 40%/80% Percoll gradient (GE Healthcare). Cells were 
stained for surface markers, followed by fixation and permeabilization (eBioscience). 
Calculating enrichment scores. An enrichment score for a given VB in IL-23R 
(GEP)* cells is defined as the equation of (per cent of VB* cells in the GFP-positive 
fraction)/(per cent of VB" cells in the GFP-negative fraction). For example, for 
Extended Fig. 2b, VB14 enrichment was calculated as (7.45/(7.45 + 26.2))/(4.48/ 
(4.48 + 61.8)) or 3.3. Ascore > 1 means a positive enrichment anda score ~ 1 means 
no enrichment. 

High-throughput TCR sequencing. The SILP cells from 1/23r°""”* mice were 
stained for surface markers and VB14* CD4* T cells were sorted on the Aria II. 
For each sample, we collected about 2 X 10° cells (2.17 + 0.43 X 10* cells for GEP* 
Ty17 cells and 2.38 + 0.54 X 10* cells for GFP~ non-Ty,17 cells). Cells were lysed 
in TRIzol reagent (Invitrogen) and RNA was extracted following the manufacturer’s 
instruction. RNA precipitation was aided with GlycoBlue (Invitrogen). Comple- 
mentary DNAs were prepared with a reverse transcription kit (USB). VB14 PCRs 
were performed using barcoded oligonucleotides. PCR products from 16 samples 
were quantified on NanoDrop. Equal amounts of barcoded PCR product were mixed 
and sequenced using a 454 GS Junior system (Roche). The raw sequencing data 
was first aligned using the high-throughput analysis tool provided by IMGT”. We 
obtained 6,647 + 954 reads for T}17 cells and 5,573 + 889 reads for non-Ty17 
cells. CDR3 usage was further computed with Perl-based scripts developed in- 
house. The T};17 samples had 340-772 unique VB14 CDR3 sequences and the non- 
Ty17 samples had 849-2,148 unique VB14 CDR3 sequences. 

Single-cell TCR sequencing. The SILP cells from 1123r@”’* mice were stained for 
surface markers. GFP* and GFP" VB14* CD4* T cells were sorted on the BD 
Aria II and deposited at one cell per well into 96-well PCR plates preloaded with 
5 pl reverse transcription mix (USB). Immediately after sorting, whole plates were 
incubated at 50 °C for 60 min for cDNA synthesis. Half of the cDNA was used for 
VB14 PCR using forward primer 5'-ACGACCAATTCATCCTAAGCAC-3’ and 
reverse primer 5’-AAGCACACGAGGGTAGCCT-3’. To retrieve Va sequences, 
the other half of cDNA was pre-amplified for 16 cycles using a mix of twenty-one 
forward primers* (each modified by adding a 5’ extended anchor sequence: 
TAATACGACTCACTATAGGG) and a reverse primer 5'-CATGTCCAGCACA 
GTTTTGTCAGT-3’. The primary Va PCR products were diluted and subjected 
to a second round PCR using forward primer 5’-TAATACGACTCACTATAG 
GG-3’ and reverse primer 5'-GTCAAAGTCGGTGAACAGGC-3’. PCRs were per- 
formed in a LightCycler 480 (Roche). PCR products were cleaned up with ExoSap- 
IT reagent (USB) and Sanger sequencing was performed by Macrogen. In nearly 
all cases, for cells with the same VB14 sequence, we retrieved a single unique Va 
sequence, indicating that these cells were clonotypically identical. 

Generation of TCR hybridomas. The NFAT-GFP 58a 8 hybridoma cell line’” 
was provided by K. Murphy. To reconstitute TCRs, we used a self-cleavage se- 
quence of 2A to link cDNAs of TCRa and TCR generated from annealing of 
overlapping oligonucleotides (TCRo-p2A-TCR§) and shuttled the cassette into a 
modified MigR1 retrovector in which IRES-GFP was replaced with IRES-mCD4. 
Then retroviral vectors were transfected into Phoenix E packaging cells using Lipo- 
fectamine 2000 (Invitrogen). Hybridoma cells were transduced with viral super- 
natants in the presence of polybrene (8,1g ml‘) by spin infection for 90 min at 
32 °C. Transduction efficiencies were monitored by checking mCD3 surface expres- 
sion on day 2. We generated nineteen hybridomas for predominant clonotypic TCRs 
whose Va and VB sequences were retrieved from single-cell TCR sequencing (no. 
of VB = 6 for T};17 biased clones, and no. of VB = 4 for non-Ty17 biased clones, 
no. of Va = 3. Note that a single unique Va was identified for every VB). We also gen- 
erated the OTII hybridoma using TCR sequences provided by F. Carbone (chick- 
en ovalbumin antigen-specific, 1-A?-restricted). 
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Assay for hybridoma activation. To prepare antigen-presenting cells, splenocytes 
from B6 mice that were injected intraperitoneally with 8 x 10° FLT3-B16 mela- 
noma cells 10 days before were positively enriched for CD11c* cells using MACS 
LS columns (Miltenyi). 10* hybridoma cells were incubated with 2 x 10° APCs and 
autoclaved antigens from intestinal luminal contents or faecal material for two days. 
GFP induction in the hybridomas (CD3* fraction) was analysed by flow cytometry. 
Construction and screen of whole-genome shotgun library of SFB. The shot- 
gun library was prepared with a procedure modified from a previous study”®. In 
brief, genomic DNA was purified from the feces of SFB-monoassociated mice by 
phenol:choloroform extraction. DNA was subjected to whole-genome amplifica- 
tion with the REPLI-g kit (Qiagen) following the manufacturer’s instructions. Ampli- 
fied materials were partially digested with Sau3A (NEB), then ligated with the 
BamHI-linearized pGEX-4T1 expression vector (GE Healthcare). Ligation pro- 
ducts were introduced into competent Stbl3 cells (Invitrogen). To ensure the quality 
of the library, we sequenced the inserts of randomly picked colonies. All the se- 
quences were mapped to the SEB genome. The library is estimated to contain 10* 
clones. We grew bacteria in 96-well deepwell plates (VWR) with AirPort micro- 
porous cover (Qiagen). The expression of exogenous proteins was induced by iso- 
propylthiogalactoside for 4 h. Then bacteria were heat-killed by incubating at 70 °C 
for 1 h, and stored at —20 °C until use. For antigen screens, pools of bacterial clones 
(~30 clones per pool) were added to a co-culture of APCs and hybridomas. Clones 
within the positive pools were screened individually against the hybridoma bait. 
Finally, the inserts of positive clones were subjected to Sanger sequencing. The se- 
quences were blasted against the SFB genome and aligned to annotated open read- 
ing frames. 

Epitope mapping. We expressed overlapping fragments spanning the active ORF 
using the pGEX-4T1 bacterial expression system, and colonies were used to stimu- 
late the relevant hybridoma. This process was repeated until we identified minimal 
fragments conferring antigenicity. The mapping was further verified by stimulat- 
ing hybridomas with synthetic peptides (Genescript). 

RNA-seq analysis of the SFB transcriptome. Wild-type B6 mice from Jackson 
Laboratory or Taconic Farm, confirmed for the presence or absence of SFB by quan- 
titative PCR”, were used for microbiome transcriptome analysis. Within five min- 
utes after sacrifice, the terminal ileum of each mouse was resected and luminal 
contents were squeezed with sterile forceps into a mortar cooled with liquid nitro- 
gen. 1 ml nuclease-free TE was washed through the ileum into the mortar. The total 
luminal contents and washing were then ground toa fine powder with a pestle cooled 
with liquid nitrogen and kept on dry ice. The powder was then transferred to 15 ml 
TRIzol (Life Technologies) in a 50 ml falcon tube and vortexed. The manufac- 
turer’s protocol was then used to obtain RNA. The resulting RNA was extracted 
twice with acid phenol-chloroform, precipitated, treated with Ambion Turbo DNA- 
free, and cleaned-up with an RNeasy column to yield RNA with an undetectable 
concentration of DNA by Qubit. A portion of this RNA was treated once with 
Epicentre’s Ribo-Zero rRNA removal kit, using equal volumes of specific oligo- 
nucleotides from the Meta-bacteria and Human/mouse/rat kits. An Illumina 
RNA-seq library was prepared from these samples using a previously-described 
strand-specific Nextera protocol. The resulting reads were aligned to the SFBNYU 
genome with Bowtie*! and transcript abundance was estimated using Cufflinks** 
with default parameters. 

Production of anti-SFB antibody and immunostaining. The cDNA fragments 
corresponding to amino acids 43-359 (3340N) and 734-1060 (3340C) of SFBNYU_ 
003340 were cloned into the pGEX6p1 expression vector. Recombinant proteins 
fused to N-terminal glutathione-S-transferase (GST) were expressed in E. coli BL21, 
purified with glutathione Sepharose 4B (GE), and were released with PreScission 
protease (GE). The flow-through fractions containing polypeptides without the 
GST tag were collected as immunogen. Rabbit polyclonal antibodies against both 
polypeptides were raised by Covance. For immunostaining, bacteria were fixed 
with 2% paraformaldehyde, followed by washing with 0.5% TritonX-100. Bacteria 
were incubated sequentially with primary antibody (1:1 mix of the two rabbit- 
anti-3340 antibodies) and phycoerythrin-conjugated goat anti-rabbit antibody. 
Activation of polyclonal SILP T;17 cells. GEP* and GFP” SILP CD4* T cells 
sorted from 1123r°!"’* mice were incubated with 2 X 10° APCs (CD11c™ cells 
purified from the spleen) and indicated stimuli in complete RPMI medium sup- 
plemented with IL-2 (10 Uml ') and IL-7 (5 ng ml ') for 2-3 days. Cells were 
collected and stained with VB-specific antibodies. Forward scatter increment, as 
readout for cell activation, was analysed by FACS. 

IL-17A ELISPOT assay. IL-17A ELISPOT was performed with a mouse/rat IL- 
17A ELISPOT Ready-SET-Go! kit (eBioscience). Dots were automatically enum- 
erated with ImmunoSpot software (Version 5.0). 

MHCII tetramer production and staining. I-A°/3340-A6 tetramer was produced 
as previously described”’. Briefly, QFSGAVPNKTD, an immunodominant epitope 
from SFBNYU_0033400, covalently linked to 1-A? viaa flexible linker, was produced 
in Drosophila S2 cells. Soluble pMHCII monomers were purified, biotinylated, 
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and tetramerized with phycoerythrin- or allophycocyanin-labelled streptavidin. 
To stain endogenous cells, SILP cells were first resuspended in FACS buffer with 
FcR block, 2% mouse serum and 2% rat serum. Then tetramer was added (10 nM) 
and incubated at room temperature for 60 min. Cells were washed and followed 
by regular staining at 4°C. 1-A°/2W and I-A°/LLO tetramers were previously 
described***’. 

Generation of Ty17-TCRTg mice. TCR sequences of 7B8, 1A2 and 5A11 were 
cloned into the pT« and pT vectors kindly provided by D. Mathis’. TCR trans- 
genic animals were generated by the Rodent Genetic Engineering Core at the New 
York University School of Medicine. Positive pups were genotyped by PCR and 
kept on SFB-minus flora. 

Adoptive transfer. Spleens from 7B8Tg mice were collected and disassociated. 
Red blood cells were lysed using ACK lysis buffer (Lonza). Naive Tg T cells (CD62L* 
CD44" Vp14* CD4* CD3*) were sorted on a BD Aria II. Cells were transferred 
into congenic Ly5.1 recipient mice by retro-orbital injection. In some experiments, 
we used Ly5.1/Ly5.2 TCRTg mice as donor and transferred naive Tg T cells to 
congenic Ly5.2 recipient mice. 

Heterologous expression of SFBNYU_003340 in Listeria monocytogenes. To 
generate strains of L. monocytogenes that express the SFBNYU_003340 antigen, the 
entire coding region including its predicted signal sequence was PCR-amplified from 
a plasmid containing the SFBNYU_003340 gene. The resultant PCR product was 
digested and sub-cloned into the Listeria expression vector pIMK2 (provided by 
C. Hill), allowing the gene to be expressed under the synthetic promoter Pheip 
(High expression promoter in L. monocytogenes)”’. The resultant plasmid desig- 
nated pIMK2-3340 was transformed into electrocompetent Listeria monocytogenes 
strain 10403S-inlA” (provided by N. E. Freitag) and plated on selective medium 
containing kanamycin (50 pg ml ')?*. pIMK2 is a derivative of the plasmid pPL2 
and stably integrates in single copy within the tRNA“" gene following electro- 
poration™. The integrity of the SFBNYU_003340 gene was validated by PCR and 
expression confirmed by Coomassie staining of L. monocytogenes exoproteins. 


Oral infection with SFB and L. monocytogenes. For SFB colonization, we dis- 
solved in sterile PBS fresh faecal pellets collected from 1123r°""’°"" Rag2-’~ mice 
that have highly elevated levels of SFB, and infected mice by oral gavage. For 
L. monocytogenes colonization, we grew Listeria-3340 and Listeria-empty in brain 
heart infusion medium and infected mice orally with 10° colony forming units. 
Bioinformatic analysis. Protein predictions were made by bioinformatic tools, 
including Psort (Version 3.0)” and Cello (Version 2.5)”° for localization predic- 
tion, and IEDB (Immune Epitope Database) for MHCII binding affinity prediction. 
Statistical analysis. All analyses were performed using GraphPad Prism (Version 
6.0). Differences were considered to be significant at P values <0.05. 
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Extended Data Figure 1 | Stimulation of SILP T}17 cells requires intestinal mice in the presence of APCs sufficient (WT) or deficient (KO) for MHC 
microbiota antigen presentation. a, Intestinal GFP* CD4* T cells from class II. c, Evaluation of potential activation of bystander CD4* T cells upon 


1123r°"’* mice stimulated with faecal material from Jackson and Taconic mice _ stimulation with SFB antigen. SILP CD4° T cells from mice with Jackson flora 
in the presence of syngeneic splenic APCs. Forward scatter was evaluated after (Ly5.1) and Taconic flora (Ly5.2) were co-cultured or stimulated separately 
2 days. b, Ty;17 cell activation by faecal material from SFB-monoassociated with APCs and SFB-monoassociated faecal material, and FSC was evaluated. 
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Extended Data Figure 2 | Microbiota-dependent TCR usage bias among 
SILP Ty;17 cells. a, SILP CD4* T cells from 1123r°"’* mice were analysed for 
utilization of VBs in Ty17 cells versus non-Ty17 cells. Ratios of the percentage 
of each TCR VB in GFP* vs GFP” cells are shown. Each symbol represents 
one mouse. b, Relative expression of VB14 and VB6 TCRs by SILP T};17 versus 
non-Ty17 CD4* T cells from 1123r¢!""* mice. Left, representative FACS plots; 
Right, analysis of multiple animals. c, Specific enrichment of VB14 TCRs in 
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CD4* T cells expressing RORyt and IL-17A, but not FOXP3 or IFNy. Left, 
representative FACS plots. Right, analysis of multiple animals. Each symbol 
represents one mouse. d, Correlation of VB14 enrichment in T};17 cells 
with the presence of specific commensal microbiota. B6 Jackson mice were 
housed alone or cohoused with B6 Taconic mice for two weeks. Left, 
representative FACS analyses. Right, analysis of multiple animals. 
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Extended Data Figure 3 | T,;17 TCR repertoire analysis by pyrosequencing. 
a, Numbers of unique VB14 CDR3 sequences of individual SILP T}17 and 
non-T}y17 samples. The sequences were normalized for numbers of cells and 


total reads. b, Preferential expansion of VB14* clones in the Tyy17 


compartment in the SILP. The proportions of the 10 most abundant VB14 
CDR3 sequences from Ty17 and non-Ty17 cells from 8 mice are shown. 
c, Ty17-non Ty17 bias of unique VB14 CDR3 sequences in the SILP of 
multiple mice. 
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Extended Data Figure 4 | Single-cell TCR cloning and TCR hybridoma 
screen. a, Efficiency of single-cell VB14 cloning from SILP T};17 and 
non-T}y17 cells of multiple mice. b, Distributions of unique VB14 sequences in 
Ty17 and non-Ty17 cells within the SILP. Each plot represents one mouse 
shown ina. y and x axes represent numbers of T}717 cells and non-Ty17 cells for 


each unique VB14 sequence. Numbers of unique sequences are shown in 
coloured circles. c, Responses of Tj17 and non-T}y17 TCR hybridomas to 
small intestinal luminal contents from B6 Taconic and B6 Jackson mice. 

d, Stimulation of T};17 TCR hybridomas by SFB-monoassociated antigens in 
the presence of APCs sufficient (WT) or deficient (KO) for MHC class II. 
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Extended Data Figure 5 | Identification of SFBNYU_003340 epitopes 
recognized by a subset of the Ty17 TCR hybridomas. a, Schematic 
representation of the antigen screen using a whole-genome shotgun SFB 
library. b, Stimulation of the 7B8 hybridoma by bacterial pool 3F12. 

c, Reactivity of 7B8 and four other TCR hybridomas with bacterial clone 


o10? 10° 10% 10° ow 1 a WP 


3F12-E8. d, Diversity of the CDR3 sequences of TCRs specific for 3F12-E8. 
Note that they belong to different Va subsets and have distinct VB14 CDR3 
sequences. e, Responses of the 3F12-E8-specific TCR hybridomas to core 
epitopes encoded by minigenes expressed in E. coli. 
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Extended Data Figure 6 | Identification of SEBNYU_004990 epitopes 
recognized by related TCRs. a, Top, the distribution in T}17 and non-Ty17 
cells of four TCRs that share an identical TCRa chain. Bottom, amino acid 
alignment of the VB14 CDR3 sequences. The green box highlights the sequence 
differences. b, Stimulation of the 5A11 hybridoma by bacterial pool 2D10 in 


the SFB antigen screen. c, Responses of 4 TCR hybridomas, including a 
non-T}17 hybridoma, to bacterial clone 2D10-A10. d, Responses of the 
2D10-A10-specific TCR hybridomas to core epitopes encoded by minigenes 
expressed in E. coli. e, TCR hybridoma responses to titrated synthetic peptide 
(IRWFGSSVQKYV) in the presence of APCs. 
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Extended Data Figure 7 | SFB epitopes recognized by diverse Tj;17 cell filtering approach to select candidate SFB epitopes. d, Summary of newly 


TCRs. a, The epitopes recognized by the VB14* TCR hybridomas stimulate _ selected and the known A6 and A15 SEB peptides. e, IL-17A ELISPOT screen 
only VB14* Ty17 cells from the SILP. Ty17 cells sorted from 1123r°""”* mice __ for indicated peptides using SILP Ty17 cells sorted from SFB-colonized 

were stimulated with indicated peptides (listed in d) in the presence of 1123r°F"’* mice. The A6 peptide from SFBNYU_003340 and anti-CD3 served 
APCs. Left, representative IL-17A ELISPOT assay with triplicates. Right, as positive controls. f, VB14 usage in Ty17 cells specific for peptide N5. 
normalized peptide-specific T}17 responses. Each dot represents one mouse. _Left, representative IL-17A ELISPOT assay with triplicates for peptide N5, 

b, Polyclonal responses of VB14* and VB14 SILP Ty17 cells to SFB antigens. using VB14~ and VB14 SILP T,17 cells sorted from 1123r°""’* mice. 
Representative FACS plots from five experiments are shown. ¢, Bioinformatics _ Right, normalized N5-specific T}17 responses. Each dot represents one mouse. 
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Extended Data Figure 8 | SFB-specific T cells become T};17 cells in 
SFB-colonized mice. a, SFB-dependent 7B8Tg T cell accumulation in the 
SILP. 2 X 10* naive 7B8T¢ T cells were transferred into congenic Ly5.1 
recipient mice that were SFB-colonized or SFB-free. CD4~ T cells in the 
SILP were examined for donor and recipient isotype markers after 13 days. 
b, Top, strategy for co-transfer of congenic 1A2Tg and 5A11Tg T cells into 
SFB-colonized recipient mice. Bottom, FACS analysis of RORyt expression in 
host- and donor-derived CD4* T cells in the SILP at 7 days after transfer. 

c, FACS analysis of transcription factors in host- and donor-derived SILP 
CD4* T cells after transfer of naive 7B8Tg T cells as in a. d, FACS analysis of 
SILP T cells from 1123r°""”* mice, stained with 1-A?/3340-A6 tetramer and 


LETTER 


control tetramer (2W). e, FACS analysis of SILP T cells of B6 mice from 
colonies with different microbiota, stained with 1-A°/3340-A6 tetramer 

and intracellular RORyt antibody. f, Expansion of 7B8Tg T cells in mice 
colonized with Listeria monocytogenes expressing SFBNYU_003340. 

Top, immunofluorescence microscopic visualization of the expression of SFB 
protein by L. monocytogenes. Listeria-3340 and Listeria-empty were stained 
with anti-3340 rabbit polyclonal antibody. Red, anti-3340 antibody staining. 
Blue, DAPI staining. Bottom, naive Ly5.1~ 7B8T¢ cells were transferred into 
congenic mice infected with Listeria-3340 or Listeria-empty. Seven days after 
transfer, donor-derived CD4* T cells in the SILP were analysed by FACS. 
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Extended Data Figure 9 | Transcription factor expression in SFB-specific 
and Listeria-specific T cells in co-infected mice. Representative of data 
plotted in Fig. 4b. a, Experimental design for tracking both SFB- and 
Listeria- specific CD4* T cells following intestinal colonization with both 
bacteria. Ly5.2 B6 mice were colonized with Listeria monocytogenes, SFB, or 


both bacteria, and 7B8T¢ T cells from Ly5.1 mice were injected intravenously. 
Expression of T};1 and Ty17 transcription factors in the SFB-specific 7B8Tg 
cells and LLO tetramer-specific recipient T cells was evaluated. b, Intracellular 
stain for RORyt. c, Intracellular stain for T-bet. 
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Extended Data Figure 10 | SFB-specific T17 cells are present in both SILP —_ animals. Left, per cent of tetramer-positive cells among total CD4* T cells in 
and large intestine lamina propria (LILP) of SFB-colonized mice. T cells each region of the intestine. Right, per cent of RORyt™ cells among the 

were stained with I-A°/3340-A6 tetramer and antibody to intracellular RORyt. _ tetramer-positive cells. Each symbol represents cells from a separate animal. 
a, Representative FACS plots (gated on CD4* T cells). b, Analysis of multiple 
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Nociceptive sensory neurons drive interleukin-23- 
mediated psoriasiform skin inflammation 


Lorena Riol-Blanco™, Jose Ordovas-Montanes'*, Mario Perro', Elena Naval!, Aude Thiriot', David Alvarez', Silke Paust'+, 


John N. Wood? & Ulrich H. von Andrian! 


The skin has a dual function as a barrier and a sensory interface 
between the body and the environment. To protect against invading 
pathogens, the skin harbours specialized immune cells, including 
dermal dendritic cells (DDCs) and interleukin (IL)-17-producing 
y6 T (y65T17) cells, the aberrant activation of which by IL-23 can 
provoke psoriasis-like inflammation’ ~*. The skin is also innervated 
by a meshwork of peripheral nerves consisting of relatively sparse 
autonomic and abundant sensory fibres. Interactions between the 
autonomic nervous system and immune cells in lymphoid organs 
are known to contribute to systemic immunity, but how peripheral 
nerves regulate cutaneous immune responses remains unclear™*. 
We exposed the skin of mice to imiquimod, which induces IL-23- 
dependent psoriasis-like inflammation’*. Here we show that a subset 
of sensory neurons expressing the ion channels TRPV1 and Na,1.8 
is essential to drive this inflammatory response. Imaging of intact 
skin revealed that a large fraction of DDCs, the principal source of 
IL-23, is in close contact with these nociceptors. Upon selective phar- 
macological or genetic ablation of nociceptors’"'', DDCs failed to 
produce IL-23 in imiquimod-exposed skin. Consequently, the local 
production of IL-23-dependent inflammatory cytokines by dermal 
y5T17 cells and the subsequent recruitment of inflammatory cells 
to the skin were markedly reduced. Intradermal injection of IL-23 
bypassed the requirement for nociceptor communication with DDCs 
and restored the inflammatory response’. These findings indicate 
that TRPV1*Na,1.8* nociceptors, by interacting with DDCs, regu- 
late the IL-23/IL-17 pathway and control cutaneous immune responses. 

Repeated topical application of imiquimod (IMQ) to murine skin 
provokes inflammatory lesions that resemble human psoriasis”*. This 
response is mediated by IL-23, which stimulates skin-resident 76 T cells 
to secrete IL-17 and IL-22, cytokines that induce inflammatory leuko- 
cyte recruitment and acanthosis’*. Indeed, antibodies targeting the shared 
p40 subunit of IL-12 and IL-23 inhibit both IMQ-induced murine der- 
matitis and human psoriasis*’*. Frequent symptoms in human psori- 
asis, aside from the prominent skin lesions, include the sensations of 
itch, pain and discomfort in affected areas’. Clinical reports suggest 
that intralesionally administered anaesthetics or surgical denervation 
of psoriatic lesions not only abrogate local sensation, but also amelio- 
rate local inflammation’. Similarly, in mutant mice with disseminated 
psoriasiform dermatitis, peripheral nerve dissection attenuated skin 
inflammation’*®; however, cutaneous nerves are composed of sympa- 
thetic and several types of sensory fibres, and the role of individual types 
of nerve fibres remains unclear®. Using the IMQ model, we investigated 
whether and how specific subsets of peripheral nerves contribute to the 
formation of psoriasiform skin lesions. 

Skin sensations perceived as inflammatory pain, noxious heat and some 
forms of itch are transmitted by sensory fibres that express the cation 
channel TRPV1. Most TRPV1 " fibres co-express the sodium channel 
Nay1.8 (refs 9-11). Nay1.8* nociceptors can be identified in the dermis 
of Na,1.8-TdTomato (TdT) mice by their red fluorescence? (Fig. 1a). 


Confocal microscopy of skin samples from Na,1.8-TdT mice co-stained 
for tyrosine hydroxylase (TH), which identifies sympathetic fibres, and 
for B3-tubulin, a pan-neuronal marker, revealed that Nay1.8 *TH™ noci- 
ceptors represent the vast majority of cutaneous nerve fibres, whereas 
Nay1.8 TH™ sympathetic fibres are rare. 

To investigate the roles of sympathetic fibres and nociceptors in the 
IMQ model, mice were treated systemically with either 6-hydroxydopamine 
(6OHDA) or resiniferatoxin (RTX) to ablateTH* sympathetic neurons 
or TRPV1* nociceptors, respectively'!”” (Extended Data Figs 1, 2). Sub- 
sequently, IMQ was applied topically to one ear and the ensuing inflam- 
matory response was assessed based on the change in ear thickness, size 
of the myeloid infiltrate (Extended Data Fig. 3a) and tissue contents of 
inflammatory cytokines. 

After sympathetic denervation, IMQ-induced ear swelling was reduced 
compared to controls (Extended Data Fig. 1c); however, the inflam- 
matory infiltrate was increased, whereas IL-17A, IL-17F, IL-22 and IL- 
12/23p40 production remained unchanged (Extended Data Fig. 1d-i). 
Thus, sympathetic innervation exerts little or no direct local control over 
the inflammatory skin response. The observed changes were probably 
due to cardiovascular effects and/or global immune dysregulation after 
systemic sympathectomy° (Supplementary Information). By contrast, 
in RTX-treated mice both ear swelling and inflammatory infiltrates in 
IMQ-exposed ears were profoundly reduced (Fig. 1b-e and Extended 
Data Fig. 4a—d). RTX treatment did not alter the systemic supply of inflam- 
matory cells'* (Extended Data Fig. 4e, f). Moreover, intravital microscopy 
of ear skin revealed similar leukocyte rolling in RTX-treated and con- 
trol mice (Extended Data Fig. 5), indicating that the absence of noci- 
ceptors did not affect the baseline adhesiveness of dermal microvessels”. 
It is more likely that ablation of TRPV1” sensory nerves reduced IMQ- 
induced inflammation through local, extravascular mechanisms. How- 
ever, the attenuated inflammatory response was not limited to the skin, 
as the IMQ-induced enhancement in cellularity of the draining auri- 
cular lymph node was also blunted by RTX (Fig. 1f). 

Lymph nodes have a critical function in dermal antigen presentation 
to naive T cells and in the generation of migratory effector cells. They 
also possess peripheral innervation”, which could have been altered by 
RTX; however, the role of skin-draining lymph nodes during psoriatic 
inflammation is unclear. To address this issue, we tested the effect of 
IMQ in lymphotoxin-«-deficient (LT« /~) mice, which are devoid of 
lymph nodes. Compared with wild-type mice, there was no statistical 
difference in ear thickness, frequency or composition of the inflamma- 
tory infiltrate in ears of LTx ‘~ mice (Fig. 1g, h and data not shown). 
Additionally, we treated wild-type mice with FTY720, which blocks T-cell 
egress from lymph nodes, preventing trafficking of effector cells to peri- 
pheral tissues”. Again, IMQ elicited full-fledged inflammation in exposed 
ears (Fig. 1i, j and data not shown), indicating that T-cell priming in skin 
draining lymph nodes is dispensable for the acute induction of psoriasi- 
form inflammation. Together, these findings imply that TRPV1™ noci- 
ceptors promote local immune responses directly in the skin. 
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Neutrophil recruitment to the skin and keratinocyte hyperprolifera- 
tion, hallmarks of IMQ-induced inflammation, are driven by IL-17 and 
IL-22, respectively””’*. We asked whether nociceptors regulate the pro- 
duction of these cytokines. Indeed, after IMQ treatment, protein levels 
of IL-17A, IL-17F and IL-22 had markedly increased in the ears of 
control mice, whereas in RTX-treated mice IL-17A was very low and 
IL-17F and IL-22 remained below the detection limit (Fig. 2a—c). Thus, 
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Figure 1| TRPV1* nociceptor ablation 
attenuates skin inflammation and draining 
lymph node hypertrophy in the IMQ model. 

a, Representative whole-mount confocal 
micrograph of normal ear skin from Na,1.8-TdT 
reporter mice (Na,1.8* nociceptors, red) stained 
for TH (sympathetic nerves, green) and §3-tubulin 
(peripheral nerves, blue). b-f, The ear skin of 
vehicle (dimethylsulphoxide (DMSO))-treated 
controls or TRPV1~ nociceptor ablated (RTX) 
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mice was treated with topical IMQ cream daily. 


b, Ear thickness was measured relative to 

the contralateral ear at indicated time points 

(n = 10-15 mice per time point; *P < 0.02). 

c, Representative histological sections of 
IMQ-treated ears at day 6 stained by haematoxylin 
and eosin. d, e, Total inflammatory monocytes 

(n = 10) (d) and total neutrophils (e) in skin at 
day 3 (n = 10; **P < 0.005). f, Total cell number 
in auricular lymph nodes (aLNs) at day 3 (n = 20; 
*P = 0.01; ***P < 0.001). g-j, IMQ was applied 
daily to wild-type (WT; n = 10) and LTa ‘~ mice 
(n = 6) (g, h) or vehicle-treated (n = 10) and 
FTY720-treated mice (n = 10) (i, j), and ear 
swelling was measured at the indicated time points 
(g, i). h, j, The percentage of CD45* leukocytes was 
determined in ear skin digests on day 6. NS, not 
significant. Error bars show mean ~ standard error 
of the mean (s.e.m.). 
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TRPV1* nociceptors control the generation of several key effector 
cytokines in psoriasiform dermatitis. 

As our findings in FTY720-treated mice suggested that recruitment 
of lymph-node-derived effector T cells is not needed for IMQ-induced 
skin inflammation, we asked whether nociceptors regulate cytokine pro- 
duction by skin-resident lymphocytes, particularly y6T17 cells, which 
produce IL-17F and IL-22 (refs 3, 22, 23). In the ears of IMQ-treated 
control mice, most IL17F* lymphocytes were dermal 76 T cells, whereas 
few conventional «B T cells expressed IL-17F (Fig. 2d and Extended 
Data Figs 3b, 6a). In RT'X-treated mice, both IL-17F™ and IL-22 * dermal 
75 T cells were significantly reduced, indicating that TRPV1* neurons 
drive the local production of IL-17F and IL-22 primarily by y5T17 cells 
(Fig. 2e-g and Extended Data Fig. 6b, c). These T cells seed the skin in 
early life and are poised for rapid IL-17 production upon stimulation 
by IL-23 (ref. 24). Indeed, in the ears of IL-23R oF P’* mice”, ~60% of der- 
mal y6T17 and ~20% of xf T cells expressed IL-23R (Fig. 2h, i), whereas 
T cells in auricular lymph nodes and epidermal y6 T cells expressed 
little or no IL-23R (Extended Data Fig. 6d, e). 

In light of the preferential expression of IL-23R on dermal y6T17 
cells, the known role of IL-23 as a driver of IL-17 and IL-22 generation 


Figure 2 | TRPV1* nociceptors control IL-17F and IL-22 production by 
IL-23R* dermal 5 T cells. a—c, After 3 days of IMQ challenge, ears from 
vehicle (DMSO)-treated mice (n = 5) or RTX-treated mice (n = 5) were 
harvested to perform enzyme-linked immunosorbent assays (ELISA) for 
IL-17A (a), IL-17F (b) and IL-22 (c). **P < 0.01; ***P < 0.001. ND, not 
detected. d—g, Flow cytometry on digested ear skin was performed on day 6 
of IMQ challenge. d, Relative frequency of IL-17F* or IL-22* dermal 78 T cells 
and «8 T cells in IMQ-treated control mice (n = 15). e, Representative FACS 
plots of IL-17F staining in dermal y6 T cells from vehicle (DMSO)- and 
RTX-treated mice. f, g, Quantification of frequency of IL-17F* (f) and IL-22* 
(g) among dermal yé T cells in DMSO- and RTX-treated mice (n = 5 per 
group; *P < 0.05; **P = 0.01). h, i, Representative FACS plots of normal ear 
skin from an IL-23R@"?/* mouse (h) and quantification of frequency of 
IL-23R-GEP™ cells among skin-resident Thyl* T-cell subsets (n = 8) (i). 
Epi., epidermal. Error bars show mean = s.e.m. 
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in psoriasiform inflammation’ and our finding that RTX pre-treatment 
profoundly diminished IMQ-induced cytokine production by y8T17 
cells, we proposed that TRPV1* nociceptors might control dermal IL-23 
production. Topical IMQ treatment of control mice markedly increased 
p40 protein levels, as well as messenger RNA levels of I112b (IL-12/23p40) 
and 1123a (IL-23p19), but not [12a (IL-12p35). These effects were nearly 
abolished after RTX treatment (Fig. 3a and Extended Data Fig. 7a-c), 
suggesting that TRPV1* nociceptors are essential for cutaneous IL-23 
production. It seems unlikely that IL-12 had a major impact in the IMQ 
model, because IMQ-induced inflammatory parameters were consid- 
erably reduced in IL- 23RCEP/GEP mice, which respond to IL-12 but not 
IL-23 (Fig. 3b-f). However, IL-12-dependent skin inflammation induced 
using a chemical irritant, 2,4-dinitrofluorobenzene (DNFB)”®, was pro- 
foundly reduced in RTX-treated mice, suggesting that RTX-sensitive fibres 
also have a role in IL-12-driven dermatitis (Extended Data Fig. 7d, e). 

Of note, although IL-17F and IL-22 production as well as myeloid 
infiltrates were virtually abolished in IL-23R7'"’°" mice, ear swelling 
was only partially reduced (Fig. 3b-f), suggesting that topical IMQ pro- 
motes modest tissue oedema through an IL-23-independent pathway. 
This activity was independent of nociceptors, as RTX treatment of 
IL-23RGFP/GFP mice had no effect on the IMQ-induced swelling (Ex- 
tended Data Fig. 7f). 

Although the above findings clearly demonstrate that nociceptors 
are indispensable for IMQ-induced dermal IL-23 production, it remained 
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Figure 3 | DDC-derived IL-23 is critical to drive psoriasiform skin 
inflammation and acts downstream of RTX-sensitive nociceptors. a, After 
3 days of IMQ challenge of vehicle (DMSO)- or RTX-treated mice, ears were 
harvested and total protein was prepared to quantify IL-23p40 by ELISA (n = 5 
per experiment; ***P < 0.001). b, Ears of wild-type (WT) or IL-23R0PP/GFP 
mice (n = 5 per group) were treated daily with IMQ and ear thickness was 
measured relative to the contralateral ear at the indicated time points 

(***P < 0.001). c-f, After 3 days of IMQ challenge in wild-type (n = 5) or 
IL-23R@""’G™? mice (n = 4) total protein was prepared from ear skin and 
IL-17F (c) and IL-22 (d) were quantified by ELISA (*P < 0.05; ***P < 0.001). 
Cell suspensions from exposed ears stained for inflammatory monocytes (e) 
and neutrophils (f) to assess total numbers (n = 5 per experiment; *P < 0.05; 
**P < 0.01). g-i, IL-23 was injected intradermally every other day into the ears 
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possible that nociceptive fibres exert additional pro-inflammatory func- 
tions, for example, by directly regulating yoT17 cells. To address this, we 
performed intradermal IL-23 injections, which trigger psoriasiform skin 
inflammation in murine skin’’. Regardless of whether nociceptors were 
ablated or left intact, IL-23 administration resulted in profound ear 
swelling and fully rescued IL-17A and IL-17F production by Thy1* 
cells (Fig. 3h, i) and yST17 cells (Extended Data Fig. 7g, h). Together, 
these results suggest that the pro-inflammatory function of TRPV1~* 
neurons is rooted exclusively in the promotion of IL-23 production, at 
least in this experimental setting. 

Next, we sought to identify the IL-23-producing cell type(s) regu. 
lated by TRPV1* nociceptors. In intestinal barrier tissues, Ly-6C"®" 
inflammatory monocytes were identified as a major source of IL-23 in 
a colitis model’’. However, depletion of neutrophils and inflammatory 
monocytes with anti-Gr-1 did not affect IMQ-induced dermal IL-23p40 
or IL-17F levels (Fig. 3j, k and Extended Data Fig. 8a), suggesting that 
skin-resident dendritic cells and/or macrophages rather than migratory 
myeloid cells were the critical source of IL-23. Thus, we injected CD11c- 
diphtheria toxin receptor (DTR) mice with diphtheria toxin (DTX), which 
depleted DDCs and Langerhans cells, but not macrophages (Extended 
Data Fig. 8b, c). After DT X treatment, IMQ-induced expression of I/23a 
mRNA was markedly reduced in treated ears (Fig. 31). Although these 
results do not discriminate between the relative contributions of Lang- 
erhans cells and DDCs to IL-23 production, IL-34-deficient mice— which 
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of DMSO- or RTX-treated mice (n = 8 per group). g, Ear thickness was 
measured as indicated. h, i, After 3 days, the number of IL-17A-producing 
(h) and IL-17F-producing (i) Thy1* cells per ear was quantified by FACS 

(n = 5). j, k, Mice were treated with anti-Gr-1 to deplete neutrophils and 
inflammatory monocytes or isotype-matched control monoclonal antibody 
and challenged with IMQ for 3 days. Ear skin protein lysates were analysed for 
IL-23p40 (j) and IL-17F (k) by ELISA (n = 5).1, CD11c-DTR mice were treated 
with DTX or PBS 12h before IMQ challenge. After 6h, ears were harvested 
and processed for total RNA isolation and [/23a mRNA levels were analysed 
by quantitative polymerase chain reaction (qPCR) (n = 4; ***P = 0.001). 
AU, arbitrary units. ND, not detected; NS, not significant. Error bars show 
mean + s.e.m. 
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selectively lack Langerhans cells—are not impaired in IMQ-induced 
skin inflammation®. We focused our analysis on DDCs, which can be 
further subdivided into CD103* and CD11b* subsets”*. Both subsets, 
as well as macrophages, were sorted by fluorescence-activated cell sort- 
ing (FACS) from IMQ-treated and control ears to measure mRNA for 
1123a and Il12a. Although CD103* DDCs showed the most notable 
upregulation of [123a mRNA, taking into consideration that CD11b* 
DDCs are more abundant, we estimate that the latter subset produced 
~75% of the 1123a mRNA, consistent with a recent report” (Fig. 4a and 
Extended Data Fig. 8c, d). 

Having identified DDCs as the principal source of IMQ-induced 
IL-23, we sought to characterize the spatial relationship between DDCs 
and cutaneous nerves. Remarkably, confocal microscopy of skin whole 
mounts revealed that at steady state ~75% of DDCs were either in direct 
contact or in close proximity to sensory nerves (Fig. 4b-d and Extended 
Data Fig. 9a). Interactions were apparent along the entire length of nerves, 
suggesting that DDCs may receive signals from unmyelinated nocicep- 
tor axons and not just from nerve terminals. However, given the high 
density of peripheral nerves in the skin, it was difficult to judge whether 
the association with DDCs occurred merely by chance or reflected a 
biased distribution. To address this possibility, we compared DDC local- 
ization relative to two other dense anatomical structures: blood and lymph 
vessels. In resting tissues, contacts of DDCs with these microvascular net- 
works were only about half as frequent as with peripheral nerves (Fig. 4d). 

Of note, even though inflammation enhances DDC motility and egress 
into draining lymphatics, IMQ challenge did not alter the frequency of 
DDCs contacting nerves (Extended Data Fig. 9b), suggesting that DDCs 
engaged in dynamic interactions with nociceptors. We reconstituted 
Na,1.8-TdT mice with yellow fluorescent protein (YFP)-tagged CD11c 
bone marrow and performed time-lapse multiphoton-intravital micros- 
copy (MP-IVM) in ear skin to generate three-dimensional time-lapse 
videos of interactions between YFP* DDCs and TdT™ nociceptors. These 
interactions were highly diverse (Supplementary Video 1); some DDCs 
seemed to be anchored on nerves and sometimes extended protrusions 
to probe the surrounding tissue, whereas others appeared to use nerve 
fibres as a scaffold for directional migration (Supplementary Video 2). 

Together, our results support the idea that DDCs can physically inter- 
act with a subset of nociceptors that regulate the production of IL-23. 
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However, it should be cautioned that the pharmacological target of RTX, 
TRPV1, is also expressed on some non-neuronal cells'®. Consequently, 
we sought an alternative to confirm the role of nociceptors in an exper- 
imental system that does not rely on pharmacologically targeting TRPV1. 
To this end, we used Na,1.8-diphtheria toxin (DTA) mice in which 
Na,1.8* fibres, which respond to mechanical pressure, inflammatory 
pain and noxious cold, are selectively deleted’. Although nociceptor- 
associated transcripts in dorsal root ganglia were more profoundly reduced 
in this genetic model than after RTX treatment, Trpvl mRNA levels 
were only reduced by ~80%, consistent with the fact that noxious heat 
sensing fibres express TRPV1, but not Na,1.8 (Extended Data Figs 2c, 9c)’. 
After IMQ challenge, the ears of Na,1.8-DTA mice contained very low 
protein levels of IL-17A, IL-17F, IL-22 and IL-23p40 as compared to 
littermate controls (Fig. 4e-h). We conclude that Na,1.8° TRPV1 * neu- 
rons are insufficient to induce IL-23 production; rather, NaV1 8*TRPVI1* 
nociceptors are driving the response. 

In light of these results, we propose a model of cutaneous neuroimmune 
interactions (Extended Data Fig. 10) whereby dermal Na,1.8*TRPV1* 
nociceptors are essential to induce IL-23 production by nearby DDCs. 
IL-23 then acts on IL-23R* y8T17 cells to induce IL-17F and IL-22 
secretion, which precipitates the recruitment of circulating neutrophils 
and monocytes driving psoriasiform skin inflammation. The fact that 
both RTX and Na,1.8-DTA mice largely preserve a dense meshwork of 
dermal nerves (Extended Data Fig. 9d, e) implies that DDCs do not 
simply rely on nerves as a scaffold from which to produce IL-23. It is 
more likely that Na,1.8*TRPV1* nociceptors actively induce and con- 
trol IL-23 production. 

Although further studies will be needed to determine the precise 
molecular underpinnings of neuroimmune communication in the skin 
(Supplementary Information), the present findings indicating nociceptor- 
mediated control of DDCs and the IL-23/IL-17 axis open new avenues 
for the treatment of inflammatory diseases in the skin and perhaps else- 
where. Intriguingly, recent work has shown that Nayl .8* nerve fibres exert 
immunosuppressive activity during Staphylococcus aureus infection”, 
raising the possibility that some pathogens have evolved mechanisms 
to subvert the pro-inflammatory function of nociceptors. Together, this 
recent work and the present study suggest an emerging new paradigm 
whereby TRPV1*Na,1.8* nociceptive fibres integrate environmental 
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Figure 4 | DDCsare closely associated with cutaneous nerves and depend on 
Na,1.8* nociceptors for IMQ-induced IL-23 production. a, Mice were 
challenged with IMQ (n = 20 pooled mice per condition) and, after 6h, 
myeloid cell populations comprising dermal macrophages (mac.) and two 
subsets of DDCs were FACS sorted (Extended Data Fig. 8c) from cell 
suspensions to measure [/23a mRNA by qPCR. b, Representative confocal 
micrographs of ear skin whole mounts from CD11c-YFP mice stained for 
63-tubulin (peripheral nerves, red) and Lyve-1 (lymphatics, blue) or CD31 
(blood and lymphatic endothelial cells, blue). Original magnification, 200. 
c, Close-up confocal micrograph of a CD11c-YFP cell in contact with 
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a nerve (see also Supplementary Videos 1, 2). d, Quantification of 
three-dimensional DDC proximity to peripheral nerves, lymphatics and blood 
vessels in normal ear skin. The frequency of DDCs (n = 330) in contact, 
proximal (0-7 jim) and distal (>7 um) to nerve fibres was determined as 
described in Methods and a Chi-squared test showed bias of DDCs to 

nerves relative to lymphatics and blood vessels (***P < 0.0001). e-h, Ears of 
Na,1.8-DTA or control littermates were treated daily with IMQ. Total protein 
was prepared from ear skin after 3 days and IL-17A (e), IL-17F (f), IL-22 (g) 
and IL-23p40 (h) were quantified by ELISA (n = 4 ears per condition; 

*P < 0.05; **P < 0.01). ND, not detected. Error bars show mean + s.e.m. 
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signals to modulate local immune responses to a variety of infectious 
and inflammatory stimuli. 


METHODS SUMMARY 


Published mouse strains used in this study are detailed in Methods. Nay1.8-Cre 
mice’ were bred with Rosa26-DTA and Rosa26-TdT mice to generate Na,1.8-DTA 
mice for functional studies and Na,1.8-TdT mice for imaging, respectively. TRPV1~ 
nociceptors were deleted using three escalating doses (30 pgkg ', 70 ugkg' and 
100 pg kg” ') of RTX as described previously"". To induce psoriasiform ear inflam- 
mation, 8-12-week-old mice were treated topically with 5% IMQ cream or injected 
with 500 ng per ear recombinant (r)IL-23. Ear thickness was measured using an 
engineer’s micrometer (Mitutoyo). Cytokines were quantified from skin protein 
extracts by ELISA (Biolegend, R&D). For flow cytometric analysis of tissue leuko- 
cyte markers and intracellular cytokines, single-cell suspensions from ear skin were 
prepared by enzymatic digestion*. Imaging of fixed skin tissue was performed using 
an Olympus Fluoview BX50WI inverted microscope, and MP-IVM in live anaes- 
thetized mice was performed using an upright microscope (Prairie Technologies) 
with a MaiTai Ti:sapphire laser (Spectra-Physics). All animal studies were approved 
by the Institutional Animal Care and Use Committee of Harvard Medical School 
and complied with National Institutes of Health guidelines. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 

Mice. C57BL/6 mice, 4—8 weeks old, were purchased from Charles River or the Jack- 
son Laboratory and female mice were used in experiments. IL-23R0"’“" mice”, 
in which GFP is knocked into the cytoplasmic tail of IL-23R and the homozygous 
GFP mouse acts as a functional receptor knockout, were provided by M. Oukka 
and both male and female mice were used in experiments. CD11c-YFP mice”! were 
a gift from M. Nussenzweig and both male and female mice were used in experi- 
ments. LTo /~ mice** were purchased from the Jackson Laboratory and male mice 
were used in experiments. CD11c-DTR* mice were purchased from Jackson Labo- 
ratory and both male and female mice were used in experiments. Nay1.8-Cre mice 
were previously described’. Rosa26-DTA mice and Rosa26-TdT mice, which express 
a floxed-STOP cassette upstream of the ubiquitously expressed Rosa26-DTA or 
Rosa26-TdT construct, respectively, were purchased from Jackson Laboratory. 
Na,1.8-Cre male mice were bred with Rosa26-DTA and Rosa26-TdT female mice 
to generate Nay1.8-DTA for functional studies and Nay.1.8-TdT for imaging exper- 
iments, respectively, of which both matched male and female litters were used in 
experiments. Glial fibrillary acidic protein (GFAP)-TdT mice were generated by 
crossing heterozygous GFAP-Cre mice™ purchased from Jackson Laboratory with 
Rosa26-TdT mice. MHC-II-GFP mice* were provided by M. Boes. Bone marrow 
chimaeras were generated by irradiating Na,1.8-TdT mice or GFAP-TdT mice with 
two split doses totalling 1,300 rad and reconstituting with CD11c-YFP or MHC-II- 
GFP unfractionated bone marrow injected intravenously, respectively. Bone mar- 
row chimaeras were allowed to rest for 12 weeks before use. Mice were all housed 
in specific pathogen-free conditions in accordance with the National Institutes of 
Health and all experimental animal protocols were approved by the Institutional 
Animal Care and Use Committee at Harvard Medical School. For most animal 
experiments, because the contralateral ear served as a control for the ear in which 
an inflammatory stimulus was applied, no randomization was used. Investigators 
were blinded for the initial ear-swelling experiments and subsequently no blinding 
was used. 

Denervation. RTX, a capsaicin analogue, was injected subcutaneously into the flank 
of 4-week-old mice in three escalating (30 pg kg ', 70 pg kg ' and 100 p'gkg') doses 
on consecutive days'’. Control mice were treated with vehicle solution (DMSO in 
PBS). Mice were allowed to rest for 4 weeks before denervation was confirmed by 
tail-flick assay. The tail-flick assay was conducted by holding mice vertically in a 
relaxed fashion allowing their tail to be immersed in a temperature-controlled 
water bath maintained at 52 °C. Denervated mice exhibited a tail-flick latency of 
>10 s. Besides insensitivity to noxious heat stimuli, overall behaviour qualitatively 
remained unaltered in RTX-treated mice. For chemical sympathectomy, mice 
were intraperitoneally injected with 80 mg kg” ' 6-OHDA in 0.01% ascorbic acid 
in PBS at day —3. Control mice received injections of 0.01% ascorbic acid in PBS”. 
IMQ and DNEB treatment, IL-23 injection and ear measurement. Mice, 8-12 
weeks old, of indicated genotypes or pharmacological treatments, were treated with 
25 mg of 5% IMQ applied topically to dorsal and ventral aspects of ear skin totalling 
1,250 pig of IMQ per day. Mice were treated starting on day 0 three times and killed 
for analyses on day 3 or were treated six times and killed for analyses on day 6. IL-23 
was injected intradermally into the dorsal aspect of the ear skin as a 50g ml! 
solution in 10 pl PBS (500 ng per ear). Contralateral ears were injected with PBS 
alone. A 0.5% solution of DNFB in acetone was applied to the dorsal and ventral 
aspects of ear skin without prior sensitization. Ear thickness was measured using an 
engineer’s micrometer (Mitutoyo) at the indicated time points and the change in 
ear thickness is calculated as the change in ear thickness from the treated ear rela- 
tive to the contralateral naive or vehicle-treated ear. 

FTY720 treatment and depletion of myeloid cells. FTY720 (1 mgkg_'), an ago- 
nist which results in $1P receptor internalization, or PBS was injected daily intraper- 
itoneally starting from day 0 of IMQ challenge. Efficacy of treatment was confirmed 
by a marked reduction in circulating lymphocytes from peripheral blood (data not 
shown). To deplete dendritic cells from skin, CD11c-DTR mice® were injected at 
day —1 with 4 ng of DTX per g mouse, a dose that is nontoxic to murine cells not 
expressing the DTR”*. Depletion of DDCs and Langerhans cells, but not macro- 
phages, was confirmed by flow cytometry (Extended Data Fig. 8). To deplete neutro- 
phils and inflammatory monocytes, 500 1g of anti-Gr-1 per mouse (clone RB6-8C5; 
BioXCell), was injected intraperitoneally at days — 1,0, 1 and 2. Depletion was con- 
firmed by flow cytometry of peripheral blood as well as challenged ear skin show- 
ing a paucity of neutrophils and inflammatory monocytes (Extended Data Fig. 8). 
Histology. Ears were embedded in paraffin and submitted for histological analysis 
by haematoxylin and eosin staining to the Harvard Rodent Histopathology Core. 
Whole-mount immunofluorescence of ear skin. Ears were harvested and split 
into dorsal and ventral halves. After fixation in 4% paraformaldehyde, any adherent 
cartilage was removed under a dissecting microscope to expose the dermis evenly 
for imaging. Tissue was blocked in blocking buffer containing PBS with 0.5% BSA, 
0.3% Triton X-100, 10% goat serum and Fc Block and also stained with the following 
antibodies in the same buffer. Unconjugated antibodies used include: anti-neuronal 


class III B-tubulin (clone TUJ1; Covance), anti-TH (clone A2B5-105; Millipore), 
anti-Lyve-1 (clone ALY7; eBiosciences), anti-peripherin (Polyclonal; Abcam) and 
anti-NeuN (A60; Millipore). Alexa488-conjugated antibodies include: anti-neuronal 
class III B-tubulin (clone TUJ1; Covance), goat anti-mouse IgG (Invitrogen) and 
goat anti-rabbit IgG. Alexa647-conjugated antibodies include: goat anti-rat IgG and 
goat anti-rabbit IgG. Washing steps were performed in PBS with 0.2% BSA, and 0.1% 
Triton X-100. Ears were mounted with the dermis facing the imaging plane in 
FluorSave reagent (Calbiochem). 

Confocal microscopy. Confocal images were acquired on an Olympus Fluoview 
BX50WI inverted microscope with X10/0.4, *20/0.5 and X40/1.3 magnification/ 
numerical aperture objectives. For images used in three-dimensional analysis, image 
planes were acquired at 0.5 1M intervals through the imaging volume. 

Intravital two-photon microscopy. Anaesthetized mice were placed on a custom- 
built stage and the ear was fixed to a temperature-controlled metallic support to facil- 
itate exposure of the dorsal aspect to the water-immersion X20 objective (0.95 
numerical aperture) of an upright microscope (Prairie Technologies). A MaiTai 
Ti:sapphire laser (Spectra-Physics) was tuned between 870 nm and 900 nm for mul- 
tiphoton excitation and second-harmonic generation. For dynamic analysis of cell 
interaction in four dimensions, several x-y sections (512 X 512) with z spacing rang- 
ing from 2 jm to 4 jum were acquired every 15-20 s with an electronic zoom varying 
from 1X to 3X. Emitted light and second-harmonic signals were directed through 
450/80 nm, 525/50 nm and 630/120 nm bandpass filters and detected with non- 
descanned detectors. Post-acquisition image analysis, volume rendering and four- 
dimensional time-lapse videos were performed using Imaris software (Bitplane 
scientific software). 

Intravital microscopy and image analysis. Intravital microscopy of skin was per- 
formed as previously described”. In brief, control or RTX-treated mice were ana- 
esthetized and the left ear was exposed and positioned for epifluorescence intravital 
microscopy. Preparations were transferred to an intravital microscope (IV-500; 
Mikron Instruments), equipped with a Rapp OptoElectronic SP-20 xenon flash lamp 
system and QImaging Rolera-MGi EMCCD camera. The fluorescent dye rhodamine- 
6G (20 mgkg * in PBS) was administrated through the catheterized right carotid 
artery to visualize circulating leukocytes. Cell behaviour in skin venules was recorded 
in 10 min recordings through X10 or X20 water-immersion objectives (Achroplan; 
Carl Zeiss). Rolling fractions in individual vessel segments were determined offline 
by playback of digital video files. The rolling fraction was determined as the percen- 
tage of cells interacting with skin venules in the total number of cells passing through 
a vessel during the observation period. 

Cytokine quantification by ELISA. Skin biopsies from ears were obtained using 
10mm diameter skin biopsy punches (Acuderm). Samples were homogenized in 
Tissue Extraction Reagent I (Invitrogen) in the presence of protease inhibitor cocktail 
(Roche) using a gentle MACS dissociator (Miltenyi Biotec). Skin protein extracts were 
assayed for IL-17A, IL-22, IL-23p40 (Biolegend) and IL-17F (R&D Systems) in accor- 
dance with the manufacturer’s instructions. 

RNA isolation and qPCR. RNA from sorted and pelleted cells was isolated using 
RNEasy Plus Mini Kit (Qiagen) including a gDNA elimination step. Ear skin and 
dorsal root ganglia (DRGs) were harvested and placed immediately in RNALater 
(Ambion) before homogenization and RNA isolation using Qiagen RNEasy Plus 
Mini Kit (Qiagen). DRGs for comparison of the efficacy of denervation were har- 
vested from equivalent anatomical locations, typically consisting of the cervical and 
thoracic ganglia from C1-T2. cDNA synthesis was done using Superscript VilocDNA 
synthesis kit (Invitrogen) following the manufacturer’s instructions. Relative quan- 
tification of transcripts was done using validated Quantitect Primer Assays (Qiagen) 
combined with the QuantiTect SYBR Green Detection Kit (Qiagen) on a LightCycler 
480 (Roche). Relative expression of genes was calculated to Gapdh using the ACy 
method. 

Tissue digestion. Single-cell suspensions of lymph nodes, spleen and bone mar- 
row were generated as described previously’. For ear skin digestion, the dorsal and 
ventral aspects of the ear were mechanically separated before mincing and placing 
into a digestion mix modified from that described previously*. Ears were digested 
for 80 min at 37 °C in gentleMACS tubes (Miltenyi) with gentle agitation in freshly 
prepared digestion mix consisting of DMEM (Gibco) supplemented with HEPES 
(Invitrogen), 2% FCS, 100 jig ml 1 Liberase TM (Roche), 100 ug ml” 1 DNase I (Roche) 
and 0.5 mg ml | Hyaluronidase (Sigma). After enzymatic digestion, the mixture 
was processed using a gentle MACS homogenizer (Miltenyi) to obtain a cell sus- 
pension, which was then filtered through a 70 [1M cell strainer (BD). Cells were 
then resuspended in FACS buffer for analysis. If cell suspensions were to be ana- 
lysed for cytokine-producing cells, ears were harvested and digested in the pre- 
sence of Brefeldin A (Biolegend). 

Flow cytometry, cell sorting and cell counts. Single-cell suspensions in FACS 
Buffer (PBS with 2 mM EDTA and 2% FCS (Invitrogen-GIBCO)) were pre-incubated 
with Fc Block (clone 2.4G2) before staining for surface antigens. FITC-conjugated anti- 
bodies used include: anti-Ly-6G (clone 1A8; BD Pharmingen). Alexa488-conjugated 
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antibodies used include: anti-CD103 (clone 2E7; Biolegend). PE-conjugated anti- 
bodies used include: anti-6-TCR (clone GL3; Biolegend), anti-Ly-6C (clone HK1.4; 
Biolegend). PerCP/Cy5.5-conjugated antibodies used include: anti-CD45.2 (clone 
104; Biolegend). PE-Cy7-conjugated antibodies include: anti-TCR-B (clone H57- 
597; Biolegend), anti-CD11c (clone HL3; BD Pharmingen). Alexa647-conjugated 
antibodies used include: anti-CCR6 (clone 140706; BD Pharmingen), anti-CD11b 
(clone M1/70; Biolegend). APC-Cy7-conjugated antibodies used include: anti-Thy1.2/ 
CD90.2 (clone 30-H12; Biolegend), anti-I-A/I-E ‘Class-IP (clone M5/114.15.2; Bio- 
legend). Cells were then washed with PBS and resuspended in MACS buffer for 
immediate acquisition or fixed in Cytofix (BD Pharmingen) as per the manufac- 
turer’s instructions for later acquisition. For analysis, cells were acquired on a BD 
FACS CANTO (BD Pharmingen) and analysed using FlowJo software (Treestar). 
For intracellular cytokine staining, cells were not restimulated and rather harvested 
and digested in the presence of Brefeldin A (Biolegend). After staining for surface 
antigens, cells were then fixed and permeabilized using BD Cytofix/Cytoperm kit 
(BD Pharmingen) as per the manufacturer’s instructions. Cells were stained in Perm/ 
Wash buffer. Alexa488-conjugated antibodies used include: anti-IL-17A (clone TC11- 
18H10.1; Biolegend), anti-IL-17F (clone 9D3.1C8; Biolegend). Alexa647-conjugated 
antibodies used include: anti-IL-17A (clone TC11-18H10.1; Biolegend), anti-IL-17F 
(clone 9D3.1C8; Biolegend) and anti-IL-22 (clone Poly5164; Biolegend). For deter- 
mining total counts of cell subsets, an aliquot of the same cell suspension used for 
flow cytometry was stained for CD45.2 and acquired on an Accuri Cytometer (BD 
Biosciences) with a known amount of CountBright counting beads (Invitrogen). 
The total CD45* cell number was then determined in the original cell suspension 
and used for total quantification of the cell number of each subset of interest defined 
by multi-parameter flow cytometry based on that subset’s frequency relative to the 
CD45* population. For cell sorting, cells were stained for indicated surface markers 
and sorted using a BD FACSAria (BD Biosciences) into complete DMEM media 
before RNA extraction. 

Image analysis. Images from confocal microscopy and from two-photon micro- 
scopy were analysed on either Volocity software (Improvision) or Imaris (Bitplane). 
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For determining the distance of CD11c-YFP cells from nerves, lymphatics or blood 
vessels, the centroid of dendritic cells was calculated by an unbiased approach, and 
the distance between a given dendritic cell centroid and the closest nerve, blood 
vessel or lymphatic vessel was calculated using Imaris software. To bin cells into con- 
tact (<0 jum), proximal (0-7 um) and distal (>7 jum) fractions, the calculated aver- 
age radius of a dendritic cell (7.04 um) was subtracted from each measured distance. 
Statistical analyses. Precise experimental numbers of animals are reported in the 
figure legends. Experiments were repeated at least three times except in Figs 2e and 
3e-g in which two replicates were done assessing ten mice total in each experimen- 
tal group overall. Some data, such as ear-swelling curves, represent pooled averages 
of the sum total of animals used in experiments whereas other data consist of a rep- 
resentative experiment of the independent experiments. All statistical analyses were 
performed using Prism (GraphPad Software) and results are calculated as means 
with error bars representing the s.e.m. Means between two groups were compared 
by using a two-tailed t-test. Means between three or more groups were compared by 
using a one-way or two-way ANOVA. A Chi-squared statistical analysis was per- 
formed for Fig. 4d comparing the total number of dendritic cells in contact, prox- 
imal and distal bins relative to nerves versus lymphatic vessels or versus blood vessels. 
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Extended Data Figure 1 | 6(OHDA treatment ablates sympathetic nerve (ascorbic acid)-treated and sympathectomized (60HDA) mice. c-i, Analysis 
function and reduces ear swelling, but does not ameliorate the of the inflammatory response in ears of vehicle (ascorbic acid)-treated and 
inflammatory response to IMQ treatment. a, The experimental protocol was — sympathectomized (60HDA) mice after daily topical IMQ challenge: 
as follows. Mice were injected intraperitoneally with 6OHDA, resulting in a timecourse of change in ear thickness of IMQ-treated ear relative to the 
reversible chemical sympathectomy lasting for approximately 2 weeks. contralateral ear (n = 10; *P < 0.05; **P < 0.01) (c) and total number of 
After a rest period of 3 days animals were challenged topically on the ear infiltrating monocytes (d) and neutrophils (e), and the amount of IL-17A (f), 
with IMQ. b, Representative section of splenic white pulp showing B cells IL-17F (g), IL-22 (h) and IL-23p40 (i) in protein extracts of IMQ exposed ears 
(B220, white), T cells (CD3, red), and TH* (green) nerve fibres in vehicle at day 3 (*P < 0.05; n = 5). NS, not significant. 
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Extended Data Figure 2 | RTX treatment diminishes noxious heat sensation 
and decreases the expression of nociceptor markers on dorsal root ganglia. 
a, Schematic protocol of nociceptor ablation and induction of psoriasiform skin 
inflammation. RTX was injected subcutaneously into the back in three 
escalating doses (30 ugkg” ', 70 pg kg’ and 100 ug kg’) on consecutive days 
and mice were allowed to rest for at least 4 weeks before IMQ treatment. 
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b, Denervation was confirmed by immersing the tail of mice into a 
temperature-controlled water bath maintained at 52 °C and the latency to the 
first tail movement to avoid water was measured (n = 6). c, Total RNA was 
isolated from dorsal root ganglia (level C1-C7) of vehicle (DMSO)- and 
RTX-treated mice and the levels of Trpv1, Scn10a (Na,1.8), Tac1 (substance P), 
Mrgprd, Trpm8 and Trpal mRNA relative to Gapdh were determined (n = 3). 
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Extended Data Figure 3 | Gating strategy for myeloid cells and T-cell inflammatory monocytes and neutrophils based on Ly-6C and Ly-6G staining. 
subsets from digested ear skin. a, The ear skin of mice challenged for 3 days __ b, The ear skin of naive mice was digested as described in Methods and, 

with IMQ was digested as described in Methods and, after doublet exclusion after doublet exclusion and gating on defined FSC-A, SSC-A parameters, 

and gating on defined FSC-A, SSC-A parameters, infiltrating myeloid cells were cutaneous T cells were gated on CD45", Thy1 *, and then divided into subsets 
gated as CD45" I-Ab (Class-II)~, CD11b* CD11c_, and then subdivided into _ based on staining for 5-TCR and B-TCR. 
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Extended Data Figure 4 | RTX treatment reduces the immune cell infiltrate 
upon IMQ treatment in the skin but does not affect reservoirs of 
inflammatory monocytes and neutrophils at steady state. a, The ear skin of 
vehicle (DMSO) or sensory denervated (RTX) mice was treated with topical 
IMQ cream daily and the total numbers of CD45* cells were determined on 
day 3 as explained in Methods (n = 10). b, Representative histological sections 
of untreated and IMQ-treated ears at day 3 stained by haematoxylin and eosin 
(X20) (n = 5 per condition). c, d, Total inflammatory monocytes (CD45*, 
CD11b‘, Ly-6C"") and neutrophils (CD45*, CD11b*, Ly-6Gh8") were 
determined by flow cytometry (n = 5-10 mice per time point). Two-way 
ANOVA was run to compare total numbers of inflammatory monocytes and 
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neutrophils between DMSO and RTX conditions over days 3-6 

(****P < 0.0001). One-way ANOVA was run to compare total inflammatory 
monocytes and neutrophil numbers over days 3-6 within DMSO or RTX 
conditions (**P < 0.003). e, Bone marrow was isolated from wild-type and 
RTX mice from one femur and the frequency of inflammatory monocytes 
(CD45*, CD11b*, Ly-6C™®", Ly-6G_) and neutrophils (CD45*, CD11b*, 
Ly-6C™4, Ly-6G*) relative to CD45* cells was determined by flow cytometry 
(n = 5). f, Spleens from wild-type and RTX mice were processed for flow 
cytometry and the frequency of inflammatory monocytes (CD45*, CD11b*, 
Ly-6C'®", Ly-6G_ ) and neutrophils (CD45*, CD11b*, Ly-6C™4, Ly-6G*) 
relative to CD45" cells was determined (n = 5). NS, not significant. 
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Extended Data Figure 5 | Leukocyte rolling fractions in skin venules of 
control and RTX-treated mice analysed by intravital microscopy. Combined 
results are shown for 26 venules from 5 control mice and for 20 venules from 4 
RTX-treated mice. Data are expressed as mean = s.e.m. of four experiments. 
NS, not significant. 
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Extended Data Figure 6 | Dermal 76 T cells represent a major source of 
IL-17F and IL-22 in skin during IMQ challenge and already express IL-23R 
at steady state. a, Wild-type mice were challenged with IMQ and the total 
numbers of IL17F* dermal 7 T cells and af T cells at 3 days (n = 15) or 6 days 
(n = 10) were quantified. b, Representative flow plots (related to those depicted 
in Fig. 2g) of gating for IL-22* cells within dermal 78 T cells after 6 days of 
IMQ treatment. c, Ears of DMSO- or RTX-treated mice were exposed for 6 days 
to IMQ and the frequency of IL-17F* and IL-22* cells within a T cells was 
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determined (n = 5). d, Auricular lymph node (aLN) cells from IL-23RGPP/* 
mice were analysed by flow cytometry for expression of IL-23R-GFP* cells 
within the 6 T cells and «f T cell compartment at steady state (representative 
FACS plot from eight mice analysed). e, The ear skin from IL-23R“"’* mice 
was digested and analysed by flow cytometry and the distribution of T-cells 
subsets within IL-23R-GFP™ and IL-23R-GFP fractions of Thy1* cells 
determined (representative FACS plot from eight mice analysed). NS, not 
significant. 
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Extended Data Figure 7 | TRPV1* nociceptors regulate IMQ- and 

DNEB- induced dermatitis and are upstream of IL-23. a-c, After 3 days 
of IMQ challenge, ears were harvested and processed for total RNA isolation 
and [112b (a), [123a (b) and Il12a (c) mRNA levels were analysed by qPCR 

(n = 5). d, DNFB (0.5% in acetone) was applied topically to DMSO and RTX 
mice. Time course of change in ear thickness of IMQ-treated ear relative to 
the contralateral ear is represented (n = 10). Two-way ANOVA was run to 
compare ear swelling under DMSO and RTX conditions over time 

(****P < 0.0001). e, Representative FACS plots from ears harvested after 24h 
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of DNEB application. f, IL-23R9""/SF mice were treated with RTX and then 
compared to wild-type and IL-23R“?’?* littermate controls during IMQ 
treatment. Ear thickness was calculated relative to the contralateral ear (n = 5). 
g, After two IL-23 injections into the ear skin of wild-type and IL-23R0P/4F? 
mice, the frequency of IL-17F" cells within dermal 76 T cells was determined 
by flow cytometry (n = 5). h, IL-23 was injected twice into the ear skin of 
vehicle- and RTX-treated mice and the total numbers of IL17A~ or IL-17F* 
dermal yé T cells per ear were determined by flow cytometry (n = 5). 

NS, not significant. 
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Extended Data Figure 8 | Selective depletion of migratory and skin-resident 
myeloid cell subsets in ear skin and gating strategy used for sorting to isolate 
RNA from MHC-II* cells in skin. a, Wild-type mice were treated with 
anti-Gr-1 (clone RB6-8C5 to deplete neutrophils and inflammatory 
monocytes) or matched isotype control, challenged with IMQ for 3 days and 
skin was digested to quantify the total numbers of inflammatory monocytes 
and neutrophils per ear. Shown are representative plots pre-gated on CD45* 
cells and quantification of cell numbers from n = 3 mice. b, DTX treatment 
resulted in depletion of both subsets of DDCs as well as Langerhans cells (LCs) 


but not macrophages. Cells were gated as shown in Extended Data Fig. 8c and 
normalized to levels in wild-type mice based on the frequency within the 
CD45* population from n = 4 mice. c, Ear skin from naive mice was digested 
and analysed by flow cytometry for the indicated subsets. Shown is a 
representative plot pre-gated on CD45* Class II” cells from which further 
subsets were divided based on CD11b and CD11c expression and then F4/80 
and CD103 as indicated. d, Total RNA from sorted cells was isolated and qPCR 
for Il12a relative to Gapdh was performed from naive and IMQ-treated ears 
after 6h from n = 20 pooled mice. 
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Extended Data Figure 9 | DDCs are found in close apposition to Na,1.8* 
nociceptors and characterization of RTX-treated and Na,1.8-DTA mice. 

a, Representative confocal micrographs of CD11c-YFP mice stained for 
B3-tubulin, Lyve-1 (lymphatics) and CD31 (blood and lymphatic endothelial 
cells). b, Three-dimensional quantification of DDC proximity to peripheral 
nerves in naive and 6h post-IMQ treatment ears binned into contact (<0 im), 
proximal (0-7 um) and distal (>7 jum) fractions as explained in Methods (n of 
dendritic cells = 200). c, Total RNA from dorsal root ganglia (C1-C4) of 
littermate control and Na,1.8-DTA mice was isolated and levels of mRNA for 
Trpv1 (TRPV1), Scn10a (Na,1.8), Tac1 (substance P) and Trpa1 (TRPA1) were 
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determined relative to Gapdh. This demonstrates the efficacy of the Nay1.8- 
DTA system and combined with the original reference characterizing the pain 
phenotype of these mice illustrates that a subset of peptidergic TRPV1~ nerve 
fibres is spared. d, Representative confocal micrograph of whole-mount ear 
skin of vehicle- and RTX-treated mice showing preserved nerve scaffold. 

e, Representative confocal micrographs of whole-mount ear skin of control and 
Na,1.8-DTA mice showing preserved nerve scaffold. Although dorsal root 
ganglia showed a loss of the hallmark ion channels of these nerve subsets 
(Extended Data Figs 1c, 9c), surprisingly we still observed that RTX mice and 
Na,1.8-DTA mice maintain a meshwork of nerves in the skin. 
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Ubiquitin is phosphorylated by PINK] to 


activate parkin 


Fumika Koyano'’, Kei Okatsu’?, Hidetaka Kosako?, Yasushi Tamura‘, Etsu Gol, Mayumi Kimura’, Yoko Kimura!®, 
Hikaru Tsuchiya’, Hidehito Yoshihara’, Takatsugu Hirokawa’®, Toshiya Endo”’®, Edward A. Fon’, Jean-Francois Trempe””, 


Yasushi Saeki’, Keiji Tanaka’ & Noriyuki Matsuda! 


PINK1 (PTEN induced putative kinase 1) and PARKIN (also known 
as PARK2) have been identified as the causal genes responsible for 
hereditary recessive early-onset Parkinsonism’. PINK] isa Ser/Thr 
kinase that specifically accumulates on depolarized mitochondria, 
whereas parkin is an E3 ubiquitin ligase that catalyses ubiquitin trans- 
fer to mitochondrial substrates*°. PINK1 acts as an upstream factor 
for parkin®” and is essential both for the activation of latent E3 parkin 
activity® and for recruiting parkin onto depolarized mitochondria*”. 
Recently, mechanistic insights into mitochondrial quality control medi- 
ated by PINK] and parkin have been revealed**, and PINK1-dependent 
phosphorylation of parkin has been reported’* '°. However, the require- 
ment of PINK1 for parkin activation was not bypassed by phospho- 
mimetic parkin mutation’’, and how PINK] accelerates the E3 activity 
of parkin on damaged mitochondria is still obscure. Here we report 
that ubiquitin is the genuine substrate of PINK1. PINK1 phosphor- 
ylated ubiquitin at Ser 65 both in vitro and in cells, and a Ser 65 phos- 
phopeptide derived from endogenous ubiquitin was only detected in 
cells in the presence of PINK1 and following a decrease in mitochon- 
drial membrane potential. Unexpectedly, phosphomimetic ubiquitin 
bypassed PINK1-dependent activation of a phosphomimetic parkin 
mutant in cells. Furthermore, phosphomimetic ubiquitin accelerates 
discharge of the thioester conjugate formed by UBCH7 (also known as 
UBE2L3) and ubiquitin (UBCH7~ubiquitin) in the presence of parkin 
in vitro, indicating that it acts allosterically. The phosphorylation- 
dependent interaction between ubiquitin and parkin suggests that 
phosphorylated ubiquitin unlocks autoinhibition of the catalytic 
cysteine. Our results show that PINK1-dependent phosphorylation 
of both parkin and ubiquitin is sufficient for full activation of parkin 
E3 activity. These findings demonstrate that phosphorylated ubiqui- 
tin is a parkin activator. 

Dysfunction of PINK] or parkin seems to cause an accumulation of 
low-quality depolarized mitochondria and production of excessive reac- 
tive oxygen species (ROS), which trigger familial Parkinsonism*>. One 
of the most poorly understood events in PINK1- and parkin-mediated 
mitochondrial quality control is how the E3 activity of parkin is accel- 
erated by damaged mitochondria. The simplest model is that PINK1- 
dependent phosphorylation of parkin accelerates its enzymatic activity. 
Indeed, parkin is phosphorylated at Ser 65 ina PINK1-dependent manner’. 
Furthermore, a phosphorylation-deficient mutation (Ser65Ala) of par- 
kin hindered formation of the ubiquitin-ester intermediate necessary 
for parkin activation’. However, the requirement of PINK] for parkin 
activation was not bypassed by the phosphomimetic parkin (Ser65Glu) 
(the mutation did not make PINK1 dispensable). We thus speculated 
that parkin is not the sole PINK1 substrate and that phosphorylation of 
other PINK1 substrate(s) is imperative for parkin activation. 


The structure of the amino-terminal parkin UBL (ubiquitin-like) domain 
resembles that of ubiquitin, and Ser 65 is conserved in both proteins 
(Extended Data Fig. 1a). Moreover, physical interactions between parkin 
and ubiquitin have been suggested'*’’. We therefore examined whether 
ubiquitin is phosphorylated by PINK1. To detect a phosphorylated 
ubiquitin, we performed phosphate-affinity (Phos-tag) PAGE in which 
the phosphorylated ubiquitin can be easily distinguished from the 
non-phosphorylated form as a slower migrating band'*. When PINK1- 
expressing HeLa-cell lysates treated with the protonophore carbonyl 
cyanide 3-chlorophenylhydrazone (CCCP) were subjected to Phos-tag 
PAGE, one retarded-mobility form of ubiquitin was specifically observed 
in the CCCP-treated cell lysate (Fig. 1a). When recombinant ubiquitin 
and ubiquitin with small N-terminal epitope tags were incubated with 
isolated mitochondria from CCCP-treated or untreated cells, and then 
subjected to Phos-tag PAGE (Fig. 1b and Extended Data Fig. 1b), ubi- 
quitin phosphorylation was specifically observed in the reaction con- 
taining CCCP-pretreated mitochondria (Fig. 1b, lanes 2, 4, 6). In contrast, 
the small ubiquitin-related modifier-1 (SUMO-1) protein was not phos- 
phorylated by depolarized mitochondria (Extended Data Fig. 1c), sug- 
gesting that phosphorylation of ubiquitin was specific. 

To determine the phosphorylation site, the phosphorylated ubiquitin 
was trypsinized and subjected to liquid chromatography tandem mass 
spectrometry (LC-MS/MS) analysis. Two phosphorylated ubiquitin pep- 
tides, 55-72 (TLSDYNIQKEpSTLHLVLR) and 64-72 (EpSTLHLVLR) 
were identified with high confidence, suggesting Ser 65 phosphorylation 
(Extended Data Fig. 1d). Indeed, mutation of Ser 65 prevents ubiquitin 
phosphorylation by damaged mitochondria in vitro (Fig. lcand Extended 
Data Fig. le) and following protonophore treatment in cells (Fig. 1d and 
Extended Data Fig. 1f). Next, the absolute levels of phosphorylated ubi- 
quitin of PINK1-expressing HeLa cell lysates (Extended Data Fig. 1g, h) 
were determined using absolute quantification (AQUA) peptides as stan- 
dards (Extended Data Table 1). The non-phosphorylated ESTLHLVLR 
peptide derived from endogenous ubiquitin was detected both in the pres- 
ence and absence of CCCP, whereas the Ser 65 phosphopeptide (EpSTLH 
LVLR) was only detected in CCCP-pretreated cells (Fig. le and Extended 
Data Fig. 1i). Absolute quantification revealed that approximately 3% 
of total endogenous ubiquitin is phosphorylated in PINK1-expressing 
cells following a decrease in mitochondrial membrane potential (A‘¥m). 

We next investigated whether PINK] is required for ubiquitin phosphor- 
ylation. When mitochondria derived from PINK1-knockout (PINK1 =“) 
mouse embryonic fibroblasts (MEFs)’’ were used (Fig. 2a and Extended 
Data Fig. 2a), ubiquitin phosphorylation was completely impeded even 
following CCCP pretreatment (Fig. 2a, lane 2). Wild-type PINK] acti- 
vated by CCCP treatment”’”’ complemented the phosphorylation of 
ubiquitin (Fig. 2a, lane 4), whereas PINK] mutants with decreased kinase 
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Figure 1 | Ubiquitin Ser 65 is phosphorylated when A¥m is decreased. 

a, HeLa cells stably expressing PINK1 were treated with protonophore (CCCP) 
followed by Phos-tag PAGE and immunoblot analysis with an anti-ubiquitin 
(Ub) antibody. Red asterisk, phosphorylated ubiquitin. Ub,, ubiquitinated 
proteins. b, Recombinant ubiquitin was phosphorylated by depolarized 
mitochondria prepared from PINK1-expressing HeLa cells. Black asterisk, 
cross-reacting band; HA, haemagglutinin; His,, tandem repeat (x6) of His 
residues; Phos-Ub, phosphorylated ubiquitin; Non-tag Ub, non-tagged 
ubiquitin. c, d, Mutation of Ser 65 prevents ubiquitin phosphorylation by 
depolarized mitochondria in vitro (c) and following protonophore treatment in 
cells (d). e, Absolute quantification of ESTLHLVLR and EpSTLHLVLR 
peptides in low-molecular-weight fraction of PINK1-expressing cells. Error bars 
represent mean + s.e.m. from three experiments. Statistical significance was 
calculated using a one-tailed paired t-test (significant if P < 0.05). WT, wild type. 


activity (kinase-dead, Ala168Pro, and Gly386Ala)” did not support phos- 
phorylation of ubiquitin (Fig. 2a, lanes 6, 8, 10). To demonstrate PINK1- 
catalysed ubiquitin phosphorylation more convincingly, immunoprecipitated 
PINK] (Extended Data Fig. 2b) was tested in vitro. Ubiquitin phosphor- 
ylation following incubation with the immunoprecipitated PINK1 was 
equivalent to that of CCCP-pretreated mitochondria (Fig. 2b and Extended 
Data Fig. 2c). Moreover, when recombinant ubiquitin was incubated 
with immunoprecipitated PINK] and [y-**P]ATP, incorporation of **P 
was detected in the recombinant wild-type ubiquitin (Fig. 2c, lane 2) 
but not the Ser65Ala ubiquitin mutant (Fig. 2c, lane 4). Taken together, 
these results demonstrate that PINK1 can phosphorylate ubiquitin at 
Ser 65. Because the data presented above were derived from PINK1- 
overproducing cells, we next quantified ubiquitin phosphorylation in 
intact HeLa cells (Fig. 2d and Extended Data Fig. 3) and found that 0.05% 
of total endogenous ubiquitin is phosphorylated. Although the absolute 
level was very low, given the pleiotropic functions of ubiquitin, such a 
small fraction might be a significant pool dedicated to the PINK1-mediated 
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Figure 2 | PINK1 phosphorylates ubiquitin. a, Active PINK1 is essential for 
ubiquitin phosphorylation in vitro. Mitochondria isolated from PINK1- 
knockout (PINKI~’~) mouse embryonic fibroblasts (MEFs) transfected with 
wild-type and mutant PINK] were used. b, Immunoprecipitated PINK1 
(IP-PINK1) can phosphorylate ubiquitin in vitro to the same level as 
depolarized mitochondria. c, Ubiquitin was labelled with **P by IP-PINK1 
prepared from CCCP-treated mitochondria, whereas the ubiquitin(Ser65Ala) 
mutant was not. Asterisks, heavy and light chains of immunoglobulin. 

d, Absolute quantification of endogenous phosphorylated ubiquitin in intact 
HeLa cells. Statistical analysis was performed as in Fig. le. e, Precursor 

(M + 3H)?" spectrum of the peptide harbouring phosphorylated Ser 65. 

The mass spectrometry peaks are shown in blue, and the orange bar indicates 
the position of the precursor. The theoretical position of the heavy precursor 
peptide is shown in transparent red. 


activation of parkin. We tested further the dependency on endogenous 
PINK1. HeLa cells grown in light or heavy SILAC (stable isotope labeling 
using amino acids in cell culture) media were treated with non-target 
or PINK] siRNA, respectively (Extended Data Fig. 2d), and then sub- 
jected to LC-MS/MS analysis. Phosphorylated Ser 65 peptide was pre- 
sent in control siRNA cells (Fig. 2e, peak identified as ‘light’) but absent 
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in PINK1 siRNA (siPINK1) cells (peak identified as ‘heavy’), indicating 
endogenous PINK1 phosphorylated ubiquitin at Ser 65 in intact cells. 
We next investigated the role of phosphorylated ubiquitin. If ubiquitin 
is the genuine PINK1 substrate, a phosphomimetic ubiquitin mutant 
would be expected to bypass the PINK1-dependency of parkin activation® 
or parkin recruitment onto depolarized mitochondria* ’. To monitor 
the E3 activity of parkin, autoubiquitination of green fluorescent pro- 
tein (GFP)-tagged parkin (GFP-parkin) was used as an index®. In our 
first trial, expression of phosphomimetic ubiquitin(Ser65Asp) mutants 
led to neither mitochondrial localization (Fig. 3a, b) nor autoubiquiti- 
nation of GFP-parkin (Fig. 3c, lane3). However, the CCCP treatment 
control triggered both mitochondrial localization (not shown) and auto- 
ubiquitination (Fig. 3c, lanes 4-6) of GFP-parkin. Because PINK] also 
phosphorylates parkin’*"’, we used a phosphomimetic parkin mutant, 
parkin(Ser65Glu)”, in case both phosphorylation events are necessary. 
Cytoplasmic localization of GFP-parkin(Ser65Glu) remained unchanged 
following co-expression with the ubiquitin(Ser65Asp) (Fig. 3d, e), whereas 
CCCP treatment stimulated translocation of GFP-parkin(Ser65Glu) to 
mitochondria (not shown). In sharp contrast, phosphomimetic ubiquitin 
(Ser65Asp) localized in cytoplasm (Extended Data Fig. 4a) promoted 
autoubiquitination of GFP—parkin(Ser65Glu) even in the absence of CCCP 
treatment (Fig. 3f, lane 3). Moreover, overproduction of phosphorylation- 
deficient ubiquitin(Ser65Ala) delayed CCCP-triggered parkin activation 
as compared with phosphorylatable wild-type ubiquitin, suggesting that 
the unphosphorylatable ubiquitin mutant competed with endogenous 
ubiquitin and thus exerted a dominant-negative effect (Fig. 3g). 
Recently, it was noted that the parkin Trp403Ala mutation wea- 
kened autoinhibition and accelerated E3 activity”. When GFP-parkin 
(Trp403Ala) was co-expressed with ubiquitin(Ser65Asp), its autoubiqui- 
tination was also observed even without CCCP treatment (Fig. 3h, lane 3). 
These results suggest phosphorylated ubiquitin functions as a parkin acti- 
vator; otherwise parkin might preferentially use phosphorylated ubiqui- 
tin for conjugation. To investigate this, we mutated the carboxy-terminal 
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diglycine motif of ubiquitin(Ser65Asp), which is crucial for formation 
of the thioester intermediate during the ubiquitination cascade. As shown 
in Fig. 3i, phosphomimetic ubiquitin with the diglycine mutations 
(Gly75Val/Gly76Val or Gly75Ala/Gly76Ala) still stimulated the E3 
activity of GFP-parkin(Ser65Glu) and GFP-parkin(Trp403Ala) (Fig. 3i, 
lanes 5, 6, 11, 12; endogenous ubiquitin was conjugated). Moreover, parkin 
catalyses conjugation of phosphorylation-deficient ubiquitin(Ser65Ala) 
to depolarized mitochondria following CCCP treatment (Fig. 3j, k), indi- 
cating that parkin does not exclusively use phosphorylated ubiquitin for 
ubiquitination. Nevertheless, phosphomimetic ubiquitin could be recruited 
to mitochondria and conjugated to mitofusin-2 (Extended Data Fig. 4b, c) 
in a CCCP-dependent manner. Thus, parkin conjugates both unpho- 
sphorylated and phosphorylated ubiquitin to substrates. 

We confirmed further the aforementioned results in a cell-free assay 
using GFP-parkin and purified mitochondria’*”*. Autoubiquitination 
of GFP-parkin in intact cell extracts was specifically observed when incu- 
bated with CCCP-pretreated mitochondria’*”*. In contrast, autoubiqui- 
tination of GFP-parkin mutants harbouring the Ser65Glu or Trp403Ala 
mutations was clearly observed when incubated with recombinant 
phosphomimetic ubiquitin(Ser65Asp) in the absence of depolarized mito- 
chondria (Fig. 4a, lanes 12, 18). Wild-type ubiquitin and the phosphor- 
ylation-deficient Ser65Ala mutant did not support autoubiquitination 
of parkin(Ser65Glu) or parkin(Trp403Ala) (lanes 7-10, 13-16), and 
ubiquitin(Ser65Asp) did not accelerate autoubiquitination of wild-type 
GFP-parkin (lane 6). To rule out the possible effect of PINK1 contam- 
ination, we repeated the experiments using GFP-parkin prepared from 
PINKI '~ MEFs. Autoubiquitination of both Ser65Glu and Trp403 Ala 
mutants of GFP-parkin were again observed (Fig. 4b), further support- 
ing the idea that phosphomimetic ubiquitin makes PINK1 dispensable 
for phosphomimetic parkin activation. 

We next examined directly whether phosphorylated ubiquitin (not 
phosphomimetic ubiquitin) activated parkin. Ubiquitin was phosphor- 
ylated by incubation with depolarized mitochondria, then subjected to 
Figure 3 | Phosphomimetic ubiquitin activates 
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f, Autoubiquitination of phosphomimetic 
GFP-parkin(Ser65Glu) was observed following 
co-expression of phosphomimetic 
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CCCP treatment (lane 3). g, Co-expression of 
phosphorylation-deficient ubiquitin(Ser65Ala) 
mutant delayed CCCP-dependent activation 
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ultracentrifugation to deplete mitochondria, and finally treated at 90 °C 
for 10 min. Ubiquitin is a rare heat-stable protein and resistant to 90 °C 
treatment”, whereas any contaminating PINK1 would be inactivated 
under these conditions (Extended Data Fig. 5). The phosphorylated 
ubiquitin induced clear autoubiquitination of GFP-parkin(Ser65Glu) 
and GFP-parkin(Trp403Ala) (Fig. 4c, lanes 4 and 6), indicating that 
phosphorylated ubiquitin activates parkin in vitro. Similarly, when ubi- 
quitin harboring the Gly75Ala/Gly76Ala mutation was phosphorylated 
beforehand and incubated with a GFP-parkin(Ser65Glu)-containing 
lysate, autoubiquitination of GFP—parkin(Ser65Glu) was observed (Fig. 4d, 
lane 6; note that the conjugation-competent endogenous ubiquitin is 
supplied in the parkin-containing lysate), confirming that phosphory- 
lated ubiquitin activates parkin(Ser65Glu) independent of its conjuga- 
tion (Fig. 3i), whereas non-phosphorylated ubiquitin(Gly75Ala/Gly76Ala) 
does not promote autoubiquitination (Fig. 4d, lane 4). 

To confirm the dependency of parkin activation on ubiquitin phos- 
phorylation, we used a yeast system in which all of the genomic-ubiquitin 
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genes are disrupted, the fused ribosomal proteins (L40 and $31) are com- 
plemented, and ubiquitin is expressed from a plasmid”. We introduced 
the Ser65Ala mutation into the yeast ubiquitin plasmid and co-expressed 
it with PINK] and GFP-parkin (note that yeast lack PINK] and parkin 
homologues). GFP-parkin in yeast cells harbouring wild-type ubiqui- 
tin underwent autoubiquitination when co-expressed with wild-type 
PINK1, but lacked the modification when expressed with a kinase-dead 
PINK1 mutation or pathogenic parkin mutations (Extended Data Fig. 6). 
In contrast, even when co-expressed with wild-type PINK1, GFP-parkin 
did not show this modification in yeast only expressing the mutant 
ubiquitin(Ser65Ala) (Fig. 4e). Taken together, the results obtained using 
this stringent heterologous cellular system argue strongly that ubiquitin 
phosphorylation at Ser 65 is indeed essential for parkin activation. 
Finally, we sought to define the molecular mechanism by which phos- 
phorylated ubiquitin activates parkin. We confirmed that phosphory- 
lated ubiquitin was charged on UBCH7 as readily as non-phosphorylated 
ubiquitin (Fig. 4fand Extended Data Fig. 7a), indicating a similar efficiency 
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Figure 4 | Molecular mechanism underlying parkin activation by 
phosphorylated ubiquitin. a, b, Recombinant phosphomimetic 
ubiquitin(Ser65Asp) activates GFP-parkin(Ser65Glu) and GFP- 
parkin(Trp403Ala) mutants prepared from Hela cells (a) and PINKI ‘~ 
MEFs (b). c, Recombinant phosphorylated ubiquitin activated Ser65Glu and 
Trp403Ala GFP-parkin from mitochondria-intact cells. d, Phosphorylated 
ubiquitin(Gly75Ala/Gly76Ala) that cannot be conjugated still promoted 
autoubiquitination of parkin(Ser65Glu). Non-Phos-Ub, non-phosphorylated 
ubiquitin; Phos-Ub, phosphorylated ubiquitin. e, Mitochondrial-localized 
PINK1 in yeast cells harbouring WT ubiquitin activates parkin, but parkin is 
not activated when ubiquitin is replaced with the Ser65Ala mutant. Red 
asterisks and red vertical bars in a—e indicate autoubiquitination of GFP- 
parkin. f, UBCH7 can be charged with phosphorylated Ub, as seen in the shift 
observed in UBCH7~Phos-Ub (lane 4, red asterisk indicates thioester-linked 
ubiquitination), or with ubiquitin(Ser65Asp) (lanes 7, 8). CBB, Coomassie 
Brilliant Blue; DTT, dithiothreitol. g, phosphomimetic and phosphorylated 


ubiquitin accelerates discharging of UBCH7~ubiquitin in the presence of 
catalytically active parkin in vitro. Error bars represent the mean + s.d. values 
in three experiments, and statistical significance was calculated using a 
one-tailed paired t-test. NS, not significant (P > 0.05). h, FluoPPI assay revealed 
a physical interaction between phosphomimetic parkin and phosphomimetic 
ubiquitin. A Cys431Ala mutation of parkin anda Gly75Ala/Gly76Ala mutation 
of ubiquitin prevented a ubiquitination-derived pseudo-positive signal. 
hAG-Ub, hAG (homo tetramer Azami-GFP)-tagged ubiquitin; Ash-parkin, 
Ash (homo-oligomerized protein assembly helper)-tagged parkin. Scale bars 
indicate 10 im. i, Foci-containing cells were calculated in 100 cells across 
three experiments. Error bars represent the mean + s.e.m. and statistical 
significance was calculated using a one-tailed paired t-test. j, A model for parkin 
activation. PINK] activates latent parkin via two phosphorylation steps at 
Ser 65 in both the parkin UBL domain and ubiquitin, which de-repress the 
RINGO domain and relieve the catalytic cysteine (Cys 431). All experiments 
were repeated twice with similar results. 
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of conjugation by E2. We then examined whether phosphomimetic 
and phosphorylated ubiquitin accelerates discharge of UBCH7~un- 
modified ubiquitin (UBCH7 harbouring thioester-linked unmodified 
ubiquitin) in the presence of parkin in vitro (note that the reaction does 
not contain ATP, thus free phosphorylated ubiquitin cannot enter into 
the conjugation pathway). Both phosphomimetic and phosphorylated 
ubiquitin accelerates discharge of ubiquitin from UBCH7~ubiquitin 
depending on the presence of catalytic cysteine (Cys 431) of parkin, 
revealing that the activation mechanism is strictly allosteric (Fig. 4g 
and Extended Data Fig. 7b). We next examined whether phosphorylated 
ubiquitin interacts physically with parkin. Using the FluoPPI (fluorescent- 
based technology detecting protein-protein interactions) technique, in 
which protein-protein interactions are detectable as foci formation in 
living cells, we demonstrated a clear interaction between parkin and 
ubiquitin in cells. Importantly, parkin with an intact UBL domain failed 
to interact with phosphomimetic ubiquitin, whereas parkin with a phos- 
phomimetic (Ser65Glu) UBL mutation was able to interact (Figs. 4h, i). 
This result suggests that the interaction between parkin and phosphor- 
ylated ubiquitin competes with the intact parkin UBL domain, whereas 
phosphorylation of the UBL domain releases this inhibitory effect. Recently, 
the structure of parkin””**** revealed that the RINGO domain occludes 
the catalytic cysteine (Cys 431). To gain further insight into the mech- 
anism, we used computational modelling to generate two structural 
models that support binding of phosphorylated ubiquitin or phosphor- 
ylated UBL domain facilitates accessibility to Cys 431 (Extended Data 
Fig. 8). Because phosphorylation at both sites is required for parkin acti- 
vation (Fig. 3), either of these conformations might be an intermediate to 
the other. Based on our combined results and the autoinhibitory effect of 
the UBL domain’’, we speculate that phosphorylation of both ubiquitin 
and the UBL domain unlock repression of the catalytic cysteine by the 
RINGO domain, thereby converting parkin to its fully-active form (Fig. 4)). 

In the present study, we demonstrate—to our knowledge, for the 
first time—that parkin activation in response to mitochondrial damage 
proceeds following ubiquitin phosphorylation by PINK]. The E3 activity 
of parkin is usually repressed via autoinhibition*'°**”*”” but is specifically 
liberated by PINK] in a unique two-step phosphorylation-dependent 
manner: one step is the phosphorylation of the UBL domain of parkin’*”, 
and the other step is, unexpectedly, phosphorylation of ubiquitin (deter- 
mined in this work). As mitochondrial dysfunction has long been asso- 
ciated with sporadic Parkinson’s disease”, a complete understanding 
of the molecular mechanism underlying parkin activation is expected 
to clarify the pathogenesis of not only hereditary but also sporadic forms 
of Parkinson’s disease. Moreover, although thousands of papers study- 
ing ubiquitin have been published, this work is the first, to our know- 
ledge, to reveal ubiquitin phosphorylation and its significance. 


METHODS SUMMARY 


Immunoprecipitation of PINK] (ref. 21), Phos-tag PAGE’*”®, in vitro ubiquitina- 
tion of GFP-parkin on depolarized mitochondria'*”, charging and discharging 
assay’, and mass spectrometric analyses'*”°”° were performed as reported previ- 
ously. Detailed procedures are available in the Methods (online). 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 

Cells, plasmids and transfections. HeLa cells and MEFs were cultured at 32- 
37 °C with 5% CO, in Dulbecco’s modified Eagle’s medium (DMEM, Sigma) con- 
taining 1X nonessential amino acids (Gibco), 1X sodium pyruvate (Gibco), and 
10% fetal bovine serum (Gibco). PINK1”'~ MEFs stably expressing wild-type or 
mutant PINK1 were established as reported previously*. To generate HeLa cells 
stably expressing PINK1-3 Flag, HeLa cells transiently expressing mCAT1 (mur- 
ine cationic amino acid transporter-1) were infected with recombinant retrovir- 
uses harbouring PINK1-3 Flag. Recombinant retrovirus was made using PLAT-E 
cells as reported previously’’. Plasmids for expressing wild-type or various mutants 
of parkin and ubiquitin were as described previously*’* or were newly constructed 
by conventional methods. Plasmid transfections were performed using the trans- 
fection reagent FuGene6 (Roche) for HeLa cells and polyethylenimine (Polyscience) 
for PINK1 '~ MEFs. To depolarize the mitochondria, cells were treated with 15- 
30 1M CCCP (Wako) for 2-3 h, except in Fig. 3g, in which cells were treated with CCCP 
for 10-30 min. siRNA targeting PINK] (J-004030-0008-0002, GGACGCUGUUC 
CUCGUUAU) or non-targeting control (D-001810-01-05, mixture of UGGUUU 
ACAUGUCGACUAA, UGGUUUACAUGUUGUGUGA, UGGUUUACAUGU 
UUUCUGA and UGGUUUACAUGUUUUCCUA) were purchased from Dhar- 
macon (Thermo Fisher Scientific). 

Phos-tag assay. To detect phosphorylated proteins using SDS-PAGE, 12.5-15% 
polyacrylamide gels containing 50 1M Phos-tag acrylamide (Wako chemicals) and 
100 uM MnCh were used (refs 15, 20). After electrophoresis, Phos-tag acrylamide 
gels were washed with gentle shaking in transfer buffer containing 0.01% SDS and 
1 mM EDTA for 10 min and then incubated in transfer buffer containing 0.01% 
SDS without EDTA for 10 min according to the manufacturer's protocol. Proteins 
were transferred to polyvinylidene difluoride (PVDF) membranes and studied by 
conventional immunoblot analysis as outlined below. 

Preparation of recombinant ubiquitin proteins. For purification of recombin- 
ant Hisg-ubiquitin, E. coli Rosseta2 (DE3) (Novagen) bacterial cells transformed 
with plasmids encoding wild-type or mutant Hisg-ubiquitin were pre-cultured over- 
night at 37 °C in 20 ml of lysogeny broth (LB) medium supplemented with 100 jg ml~* 
ampicillin and 24 yg ml” ' chloramphenicol, and then transferred to 200 ml of fresh 
medium. After incubation for 2 h at 37 °C, isopropyl-B-b-thiogalactoside (IPTG) 
was added at a final concentration of 1 mM, and E. coli were further cultured for 6 h 
at 37 °C. The cells were collected, suspended in 40 ml of 20 mM Tris-HCl, pH 7.5, 
and lysed by sonication. After centrifugation at 8,000 r.p.m. for 10 min, the super- 
natant was recovered and purified by conventional methods, dialysed against buffer 
A (50 mM Tris-HCl, pH 7.5, 100 mM NaCl, and 10% glycerol) and stored at —80 °C. 
Non-tagged ubiquitin, HA-ubiquitin, Hisg-ubiquitin harbouring C-terminal Gly75Ala/ 
Gly76Ala mutation, and SUMO-1 were purchased from Boston Biochem. 
Immunoblot analysis and immunofluorescence. To observe autoubiquitination 
of GFP-parkin (an indicator for the re-establishment of the latent E3 activity of 
parkin) in immunoblot analyses, lysates of HeLa cells or MEFs were collected in 
TNE-N* buffer (150 mM NaCl, 20 mM Tris-HCl, pH 8.0, 1 mM EDTA and 1% 
NP-40) in the presence of 10 mM N-ethyl maleimide (NEM), which protects ubi- 
quitinated proteins from deubiquitinase activity*®. For phosphorylation immuno- 
blot analyses, lysates from MEFs and HeLa cells were collected in the presence of 
PhosSTOP (Roche) to inhibit phosphatase activity. To detect various proteins via 
immunoblot analyses, the anti-ubiquitin antibody P4D1 (Santa Cruz, 1:1,000), Z0458 
(DAKO, 1:500), anti-SUMO-1 (GMP-1) antibody 21C7 (ZYMED, 1:500), anti- 
parkin antibody PRK8 (Sigma, 1:2,000), CS2132 (MBL, 1:1,000), anti-Flag antibody 
2HB8 (Transgenic, 1:500), anti-Mfn2 antibody ab56889 (Abcam, 1:500), anti-PINK1 
antibody BC100-494 (Novus, 1:1,000), anti-GFP antibody ab6556 (Abcam, 1:500), 
anti-VDACI antibody ab2 (Calbiochem, 1:1,000) and anti-FoF1 antibody (gift from 
T. Ueno, 1:1,000) were used. 

To investigate the requirement of diglycine motif of phosphomimetic ubiquitin, 
we replaced the C-terminal diglycine motif with either Val or Ala (Gly75Val/Gly76Val 
and Gly75Ala/Gly76Ala) and coexpressed with GFP-parkin(Ser65Glu) or GFP- 
parkin(Trp403Ala). Note that autoubiquitination bands of GFP-parkin are shifted 
slightly upwards in the lane by conjugation of exogenous Flag-ubiquitin(Ser65Asp) 
when Flag-ubiquitin(Ser65Asp) and GFP-parkin were co-expressed. In contrast, 
autoubiquitination bands of GFP-parkin lie further down the lane when it is co- 
expressed with Flag-tagged phosphomimetic ubiquitin lacking a C-terminal digly- 
cine motif; that is, only endogenous ubiquitin (not Flag-tagged phosphomimetic 
ubiquitin) can conjugate to GFP-parkin. 

For immunofluorescence experiments, HeLa cells were fixed with 4% parafor- 
maldehyde, permeabilized with 50 tg ml’ digitonin, and stained with a primary 
antibody (anti-GFP antibody ab6556 (Abcam, 1:500), anti-Flag antibody 2H8 (Trans- 
genic, 1:500), anti-Tom20 antibody FL-145 (Santa Crus Biotech., 1:3,000)) and a 
1:2,000 dilution of the secondary antibody (Alexa Fluor 488- or 568-conjugated 
anti-mouse or -rabbit IgG antibody, (Invitrogen)). Cells were imaged using a laser- 
scanning microscope (LSM510; Carl Zeiss) and image brightness was adjusted in 
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Photoshop (Adobe). For statistical analysis, subcellular localization of parkin and 
ubiquitin were analysed in 100 cells across three experiments, and statistical sig- 
nificance was calculated using a one-tailed Student’s t-test. When mitochondrial 
localization of phosphorylation-deficient ubiquitin(Ser65Ala) was examined by 
immunofluorescence, HeLa cells were treated with CCCP for 3 h to reach a plateau 
because overproduction of Ser65Ala delayed parkin activation. 

Cell-free reconstitution of GFP-parkin autoubiquitination. HeLa cells expres- 
sing exogenous GFP-parkin were suspended in cell-free assay buffer (20 mM HEPES- 
KOH (pH7.5), 220 mM sorbitol, 10 mM KAc, 70 mM sucrose) supplemented with 
a protease inhibitor cocktail minus EDTA (Roche). Cells were disrupted by passag- 
ing 30 times through a 25-gauge needle, and cell homogenates were centrifuged at 
800g for 10 min at 4 °C to obtain a post-nuclear supernatant. Cytosolic fractions 
were collected by further centrifugation at 20,400g for 10 min at 4 °C. For isolation 
of mitochondria, HeLa cells or MEFs expressing PINK1-3 Flag were treated with 
15-30 1M CCCP for 3 h followed by homogenization in the aforementioned cell- 
free assay buffer. Post-nuclear supernatants were then obtained by centrifugation 
as described above, and mitochondria were pelleted by further centrifugation at 
10,000g for 20 min at 4 °C. To perform the cell-free ubiquitination assay, HeLa 
cytosolic fractions containing exogenous GFP-parkin were incubated with iso- 
lated mitochondria supplemented with 5 mM MgCl, 5 mM ATP, 2mM DTT, and 
1% glycerol at 30 °C for 2 h (refs 15, 23). To demonstrate that phosphorylated 
ubiquitin activates parkin, ubiquitin was phosphorylated beforehand using iso- 
lated mitochondria, subjected to ultracentrifugation at 20,400g for 10 min at 4 °C 
to deplete mitochondria, and treated at 90 °C for 10 min. HeLa cytosolic fractions 
containing exogenous GFP-parkin were incubated with non-phosphorylated or 
phosphorylated ubiquitin (final concentration is 50 jg ml ') supplemented with 
5 mM MgCh, 5 mM ATP, 2 mM DTT, and 1% glycerol at 30 °C for 2 h. 
Immunoprecipitation of PINK1. Immunoprecipitation of PINK1 was performed 
as reported previously (ref. 21), but with some modifications. For immunopreci- 
pitation experiments, mitochondria from HeLa cells stably expressing PINK1- 
3x Flag were collected as described above, re-suspended in cell-free assay buffer 
(20 mM HEPES-KOH (pH 7.5), 220 mM sorbitol, 10 mM KAc, 70 mM sucrose), 
solubilized with 10 mg ml‘ digitonin (Wako) for 15 min at 4 °C, and reacted with 
agarose (Protein G Sepharose 4 Fast Flow, GE Life Sciences) conjugated with anti- 
Flag antibody 2H8 for 1 hat 4 °C. The resulting immunoprecipitates were washed 
repeatedly with the same buffer and collected by centrifugation. Its purity was 
confirmed by immunoblot analysis with antibodies against other mitochondrial 
membrane proteins such as VDAC, mitofusin-2 and FoF1 ATPase. 

In vitro phosphorylation of ubiquitin. To phosphorylate ubiquitin with depo- 
larized mitochondria in vitro, CCCP-treated mitochondria were collected as described 
above and then incubated with recombinant Hisg-ubiquitin (final concentration 
of 40 ng ul~*) in cell-free assay buffer (20 mM HEPES-KOH (pH 7.5), 220 mM 
sorbitol, 10 mM KAc, 70 mM sucrose) supplemented with 5 mM MgCl, 5 mM 
ATP, 2 mM DTT and 1% glycerol at 30 °C for 1 h. The supernatants were subse- 
quently subjected to centrifugation at 16,000g for 10 min to deplete mitochondria. 
To phosphorylate ubiquitin with immunoprecipitated PINK] in vitro, immuno- 
precipitated PINK1 was added to the aforementioned assay instead of depolarized 
mitochondria. For the in vitro kinase assay using [y-*’P]ATP, immunoprecipi- 
tated PINK1-3 Flag was incubated with HA-ubiquitin (10 1g) or Hisg-ubiquitin 
(wild-type or Ser65Ala; 30 pg) and 100 11M [y-32P]ATP (5 Ci) in 30 pl kinase 
buffer (20 mM Tris-HCl, pH7.5, 5 mM MgCl, and 1 mM DTT) for 30 min at 30 °C. 
The reaction was stopped by adding Laemmli’s sample buffer and boiling. One- 
third of the sample was subjected to 17% SDS-PAGE and CBB staining. Phosphor- 
ylated proteins were then visualized by autoradiography. 

LC-MS/MS analysis of ubiquitin. Mass spectrometric analyses were performed 
as reported previously (refs 15, 20, 30), but with some modification. To identify 
ubiquitin phosphorylation sites, recombinant ubiquitin was reacted with isolated 
mitochondria from CCCP-treated and untreated cells. Recombinant ubiquitin 
was then subjected to SDS-PAGE, stained with CBB, and in-gel trypsin digestion 
was carried out as reported previously”°. Gels were extensively washed with Milli- 
Q water (Millipore), the bands of interest were excised, cut into 1-mm” pieces, and 
destained with 1 ml of 50 mM ammonium bicarbonate (AMBC) buffer containing 
50% acetonitrile (ACN) with agitation for 1 h. A final 100% ACN wash was per- 
formed to ensure complete gel dehydration. Trypsinization solution (20 ng pl‘) 
was prepared by diluting modified sequencing grade trypsin (Promega) with 50 mM 
AMBC buffer, pH8.0, containing 5% ACN. The trypsin solution was added to the 
gel pieces and incubated overnight at 37 °C. Digests were quenched and extracted 
by addition of 50 pl of 50% ACN containing 0.1% trifluoroacetic acid (TFA) mix- 
ture for 1 h by shaking. The digested peptides were recovered into fresh Protein 
LoBind tubes and an additional extraction was performed with 70% ACN contain- 
ing 0.1% TFA mixture for 30 min. The extracted peptides were concentrated to 
20 ul by speed-vac. The concentrated peptides were prepared in 0.1% TFA. The resul- 
tant peptides were analyzed on a nanoflow UHPLC instrument (Easy nLC 1000, 
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Q-Exactive MS and nanoelectrospray ion source (Thermo Fisher Scientific)) with 
the raw data processed using Xcalibur (Thermo Fisher Scientific). MS spectra were 
analyzed using Protein Discoverer software version 1.3 (Thermo Fisher Scientific). 
The fragmentation spectra were searched against the UniProt database with the 
MASCOT search engine. 

To test the dependency on endogenous PINK1, HeLa cells grown in light or 
heavy SILAC (stable isotope labeling using amino acids in cell culture) media were 
treated with non-target or PINK1 siRNA, respectively. Precursor (M + 3H)°* spec- 
trum of the phosphorylated Ser 65 peptide of each sample was detected by LC-MS/ 
MS analysis. 

To quantify the phosphorylation of endogenous ubiquitin following dissipation 
of A‘¥m, PINK1-expressing or intact HeLa cells were treated with CCCP. Whole- 
cell lysates were collected in TNE-N* lysis buffer, subjected to SDS-PAGE and 
stained with CBB. The low-molecular-weight fraction (3-14 k) of the gels (as shown 
in Fig. 1g) were excised and cut into 1-mm” pieces. In-gel digestion using trypsin 
and lysyl endopeptidase (Wako) was carried out as described above. The absolute 
level of phosphorylated ubiquitin in PINK1-expressing or intact HeLa cells was 
measured by parallel reaction monitoring (PRM), a MS/MS-based quantification 
method’. The extracted peptides were spiked with 10 fmol AQUA peptides as 
standards (ESTLHLVLR and EpSTLHLVLR, SIGMA) and analysed by Q Exactive 
in targeted MS/MS mode. Data were processed by PinPoint software version 1.3 
(Thermo Fisher Scientific). 

Protein-protein interaction assay. Physical interaction between phosphomimetic 
ubiquitin and phosphomimetic parkin was detected by FluoPPI (fluorescent-based 
technology detecting protein-protein interactions) system (Amalgaam), in which 
the protein-protein interaction is detectable as the formation of fluorescent foci in 
living cells. The C-terminal diglycine motif in ubiquitin was mutated to prevent 
generation of a conjugation-derived pseudo-positive signal. This ubiquitin mutant 
was sub-cloned into phAG-MCL plasmid to fuse with a hAG (homo tetramer Azami- 
Green fluorescent protein) tag. Cys 431 in parkin was likewise mutated to prevent 
an autoubiquitination-derived pseudo-positive signal. The mutant was sub-cloned 
into the pAsh-MCL plasmid to fuse with Ash (homo oligomerized protein assembly 
helper) tag. The interaction between the hAG-tagged protein and the Ash-tagged 
protein induces formation of green fluorescent foci in cells. Ash-tagged parkin 
(Cys431Ala) and hAG-tagged ubiquitin(Gly75Ala, Gly76Ala) were co-transfected 
in cells and intracellular foci were observed using confocal microscopy (LSM710, 
Zeiss). For statistical analysis, foci-containing cells were counted in 100 cells across 
three experiments, and statistical significance was calculated using one-tailed paired 
t-test. 

Charging and discharging assay of UBCH7. The UBCH7 charging assay was 
performed as follows (ref. 22). We first incubated 40 uM His-Ub (wild-type, Ser65Ala 
or Ser65Asp) with 2 uM GST-TcPINK1 (ref. 13) and 2 mM Mg-ATP for 30 min, 
and then incubated the products (20 1M His-Ub) with 4 1M UBCH7, 0.1 uM E1, 
2mM Mg-ATP for 20 min. The reaction was stopped with sample buffer, with or 
without 50 mM DTT. The samples were resolved on SDS-PAGE with or without 
25 tM Phos-tag-MnCl,, and gel was stained with Coomassie Blue. The results 
show clearly that UBCH7 can be charged with phosphorylated ubiquitin, as seen in 
the shift observed in UBCH7~pUb, UBCH7~phosphorylated ubiquitin or char- 
ging of Ser65Asp. UBCH7 discharging assay was performed as follows (ref. 22). 
First His-UBCH7 was charged with N-terminal Fluorescein-labelled ubiquitin 
(FluoUb), using El and Mg-ATP. His-UBCH7~FluoUb was purified by Ni-NTA 
agarose, and then His- UBCH7~FluoUb (1 11M) was mixed with wild-type or mutant 
GST-parkin (0.5 |tM) in the presence of 10 11M His-Ub wild-type, Ser65Asp or 
phosphorylated HisUb, generated in vitro using TcPINK1. Mixtures were incu- 
bated at 37 °C for 10 min, resolved on SDS-PAGE, and imaged using the Typhoon 
fluorescent scanner (GE). Fluorescence strength of His- UBCH7-FluoUb in three 
experiments were measured for accurate quantification, and were subjected to sta- 
tistical analysis. 

Yeast analysis. To confirm the dependency of parkin activation on ubiquitin phos- 
phorylation, we first sought to replace genomic ubiquitin with a phosphorylation- 
deficient mutant in mammalian cells. However, this type of experiment is challenging 


because ubiquitin is encoded at four loci as fourteen copies in the human genome, 
and synthesized as a fusion protein with essential ribosomal subunits. As an alter- 
native, we used a yeast system in which all the genomic-ubiquitin genes are disrupted, 
the fused ribosomal proteins (L40 and S31) are complemented, and ubiquitin is 
expressed from a plasmid”. Yeast culture media were prepared according to stan- 
dard protocols. The Y1168 and Y1169 yeast strains, which express only wild-type 
ubiquitin and ubiquitin (Ser65Ala) respectively, were created as follows. First, 
SUB328 (MATa lys2-801 leu2-3,2-112 ura3-52 his3-A200 trp1-1 ubil-A1::TRP1 
ubi2-A2::ura3 ubi3-Aub-2 ubi4-A2::LEU2 (pUB146) (pUB100)), which expresses 
ubiquitin from a galactose-inducible promoter in pUB146 (ref. 25), was transformed 
with pRS317-Ub and pRS317-Ub(Ser65Ala). pRS317-Ub and pRS317-Ub(Ser65Ala) 
are LYS2-marked plasmids expressing wild-type or Ser65Ala yeast ubiquitin from 
a constitutive GPD (glyceraldehyde-3-phosphate dehydrogenase) promoter. The 
URA3-marked pUB146 plasmid was eliminated by streaking on plates containing 
0.5 mg ml” 5-fluoroorotic acid (Wako). The genotype of each strain is as follows: 
Y1168 (MATa lys2-801 leu2-3,2-112 ura3-52 his3-A200 trp1-1 ubil-A1::TRP1 
ubi2-A2::ura3 ubi3-Aub-2 ubi4-A2::LEU2 (pRS317-Ub) (pUB100)) and Y1169 
(MATa lys2-801 leu2-3,2-112 ura3-52 his3-A200 trp1-1 ubil-Al::TRP1 ubi2-A2::ura3 
ubi3-Aub-2 ubi4-A2::LEU2 (pRS317-Ub(Ser65Ala)) (pUB100)). 

To express human parkin in yeast, we cloned the parkin gene into a pADGEW 

plasmid (multi, TRP1). For genomic integration of human parkin, we created a 
plasmid (pBS-hphMX4-parkin) with a cassette encoding the hygromycin resist- 
ance gene, hphMX4, and human parkin tagged at the N terminus with GFP anda 
3X FLAG epitope tag, and with parkin flanked by the ADH1 promoter and CYC1 
terminator. The gene cassette amplified from pBS-hphMX4-parkin was introduced 
into the Y1168 and Y1169 yeast strains described above, and hygromycin-resistant 
transformants were selected. To localize human PINK1 on the mitochondrial sur- 
face in yeast, we fused the first 40 N-terminal residues from a yeast mitochondrial 
outer membrane protein (Om45) to the N terminus of PINK] lacking the first 120 
N-terminal amino acids (OM-PINK1); note that full-length human PINK1 was 
unstable in yeast cells. The 40 N-terminal residues of Om45 encompass a trans- 
membrane segment that is known to function as a mitochondrial outer membrane- 
targeting signal. The DNA fragment encoding OM-PINK1 with a C-terminal 3x 
FLAG-tag was cloned into a URA3-CEN plasmid, p416GALI. The resulting plas- 
mid, p416GAL1-OM-PINKI-FLAG was introduced into the parkin-expressing 
yeast strains described above. As the expression of OM-PINK1 is under the control 
of the GAL1 promoter, the yeast cells were cultivated in galactose-containing media. 
Whole-cell lysates were subjected to immunoblot analysis. PINK] and parkin expres- 
sion were confirmed by immunoblot analysis with an anti-parkin antibody Prk8 
(Cell Signaling Technology) or an anti-PINK1 antibody D8G3 (Cell Signaling 
Technology). 
Computational modelling of fully activated parkin. The two proposed models 
for fully activated parkin with structural changes in the RING2 domain incorp- 
orating either the phosphorylated Ub] domain or phosphorylated ubiquitin were 
constructed using the protein docking procedure as follows. We first simulated 
docking of phosphorylated UBL (structural region from Met 1 to Arg71 from 
Protein Data Bank (PDB) entry 4k95) or ubiquitin (PDB entry lubg) to the struc- 
ture of inactivate parkin (PDB entry 4k95) in which the RING2 domain was trun- 
cated. The RING2 domain was then reassembled in complex with UBL or ubiquitin 
using protein docking and the full atomic models refined. We used ZDOCK version 
3.0.2 to generate 2000 candidate conformation for each protein docking step. 
During the protein docking procedure, Ser 65 was mutated to glutamate because 
the ZDOCK program only supports native amino acids. The best model in the first 
step was manually selected based on the information ofa putative phosphopeptide 
binding site’’ from the structure with high ZDOCK sores. In the second step, we 
manually selected the final model based on both the ZDOCK score and domain 
connectivity between REP and RING2 domains. Finally, the Ser65GIn mutation was 
replaced with phosphorylated Ser using the molecular builder program in Maestro 
(Schrodinger). Refinement of the final model was performed using the Protein 
Preparation Wizard workflow in Maestro (Schrédinger). 
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Extended Data Figure 1 | Ubiquitin Ser 65 is phosphorylated by depolarized 
mitochondria. a, Sequence comparison between ubiquitin and the parkin UBL 
domain. Grey asterisks, identical amino acid residues. b, When the same 
samples as Fig. 1b were subjected to non-Phos-tag PAGE, no retarded-mobility 
form of ubiquitin was observed. Black asterisk, cross-reacting band. 

c, Recombinant SUMO-1 was incubated with depolarized mitochondria 
prepared from PINK1-expressing HeLa cells in vitro. Red asterisk, 
phosphorylated ubiquitin as a positive control under the same experimental 
conditions. d, Mass spectrometric (MS) analysis of the tryptic phosphopeptide 
from ubiquitin obtained in Fig. 1b. e, f, When the same samples as in Fig. 1c 
(e) and Fig. 1d (f) were subjected to non Phos-tag PAGE, no retarded-mobility 
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ESTLHLVLR EpSTLHLVLR 


Middle fraction 


form of ubiquitin was observed. g, The portion of the gel corresponding to the 
low-molecular-weight fraction of whole-cell extracts used for MS analysis of 
Fig. le. h, Whole-cell lysates of PINK1-expressing HeLa cells + CCCP 
treatment were subjected to SDS-PAGE and the high- (>55 kDa), middle- 
(14-55 k) and low- (<14 k) molecular-weight fractions were collected for MS 
analysis of Extended Data Fig. li. i, Using AQUA peptides as standards, the 
absolute quantities of the ESTLHLVLR and EpSTLHLVLR peptides in the high, 
middle and total fractions were determined in three experiments. Error bars 
represent mean + s.e.m. and statistical significance was calculated using a 
one-tailed paired t-test. 
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Extended Data Figure 2 | PINK1 is essential for ubiquitin phosphorylation. 


a, Results of the control experiments for Fig. 2a. The same samples as Fig. 2a 
were subjected to non-Phos-tag PAGE to confirm that no retarded-mobility 
form of ubiquitin was observed. b, PINK1 was immunoprecipitated and its 
purity was confirmed by immunoblot analysis with antibodies against other 
mitochondrial membrane proteins. c, Results of the control experiments for 


Fig. 2b. The same samples as Fig. 2b were subjected to non Phos-tag PAGE 
to confirm that no retarded-mobility form of ubiquitin was observed. 

d, Immunoblot analysis to confirm PINK1 knockdown. HeLa cells grown in 
light or heavy SILAC media were treated with non-target (NT) or PINK1 
siRNA, respectively, and were subjected to immunoblot analysis with an 
anti-PINK1 antibody. 
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Extended Data Figure 3 | Mass-spectrometry-based absolute quantification 
of phosphorylated ubiquitin in intact HeLa cells. Whole-cell lysates of intact 
HeLa cells + CCCP treatment were subjected to SDS-PAGE, and the 

high- (>55 k), middle- (14-55 k), or low- (<14k) molecular-weight fractions 
were collected. The absolute quantities of the ESTLHLVLR and EpSTLHLVLR 
peptides in the high, middle and total fractions were then determined using 
AQUA peptides as standards in three experiments. Error bars represent mean 
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Unknown peptide EpSTLHLVLR 


+ s.e.m. and statistical significance was calculated using a one-tailed paired 
t-test. Because the signal intensity is low, the original data and chromatogram 
of EpSTLHLVLR peptide are also shown. The light signal (derived from 
endogenous EpSTLHLVLR peptide), which shows a similar ionized pattern 
and the identical elution time as the heavy signal (derived from internal AQUA 
EpSTLHLVLR control peptide), was observed only in CCCP-pretreated cells 
(red arrows). 
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Extended Data Figure 4 | Subcellular localization of phosphomimetic or b, Mitochondrial accumulation of phosphomimetic or phosphorylation- 


phosphorylation-deficient ubiquitin under various experimental 


deficient ubiquitin depending on parkin and CCCP treatment (3 h). 


conditions. a, Phosphomimetic ubiquitin(Ser65Asp) stimulates the E3 activity  c, Phosphomimetic ubiquitin(Ser65Asp) is conjugated to the parkin substrate 


of parkin in the absence of mitochondrial localization. Example figures mitofusin-2. Red bars indicate ubiquitination of mitofusin-2 with 
indicative of cytoplasmic localization of both phosphomimetic parkin and phosphomimetic Flag-ubiquitin (lane 2) or phosphorylation-deficient 
ubiquitin are shown. Mitochondrial localization was 0% of 100 cells in two Flag-ubiquitin (lane 4). 


independent experiments. Bars indicate 10 um unless otherwise specified. 
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Extended Data Figure 5 | PINK1 was inactivated following heat treatment. 
Autoubiquitination of GFP-parkin in cell extracts was reconstituted when 
incubated with depolarized mitochondria (lane 1), whereas heat pre-treatment 
of the mitochondria abolished autoubiquitination (lane 2). Because the 
parkin-activating potency of depolarized mitochondria was lost by heat 
treatment (at 90 °C for 10 min), mitochondrial proteins including PINK1 were 
likely inactivated by the aforementioned conditions. 
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Extended Data Figure 6 | Reconstitution of parkin activation by PINK1 in 
yeast cells. A plasmid encoding GFP-parkin or a pathogenic mutant 
GEFP-parkin(Thr240Arg) under the control of a constitutive promoter and a 
plasmid encoding OM-PINK1 (40 amino acids at the N terminus of yeast 
mitochondrial outer-membrane protein Om45 were fused to the N terminus of 
PINK1 lacking the genuine 120 amino acids at the N terminus) under the 
control of a galactose (Gal)-inducible promoter were co-transfected into yeast 


al 


cells. The yeast cell lysates were collected and subjected to immunoblot analysis 
with an anti-parkin or anti-PINK1 antibody. As is the case with mammalian 
cells, a putative autoubiquitination signal was observed with GFP-parkin in 
accordance with PINK1 mitochondrial accumulation (lanes 1-4), whereas the 
PINK1 kinase dead (KD) mutation (lanes 5-8) or pathogenic parkin 
Thr240Arg mutation (lanes 9-12) hindered it. 
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Extended Data Figure 7 | In vitro charging and discharging assay of Fluorescein-labelled ubiquitin, and then purified His- UBCH7 ~FluoUb (1 1M) 
UBCH7. a, Charging assay of UBCH7 using both phosphorylated and was incubated with wild-type or mutant GST—parkin (0.5 j1M) in the presence 
non-phosphorylated ubiquitin. Samples prepared as in Fig. 4f were resolvedon of 10 LM His-Ub wild-type, Ser65Asp or phospho-HisUb. Signal derived 
SDS-PAGE using standard (non-Phos-tag) gel. b, Discharging assay of from Fluorescein-labelled ubiquitin was imaged using the fluorescent scanner. 
UBCH7 using wild-type, phosphomimetic and phosphorylated ubiquitin Fluorescence strength of His-UBCH7~FluoUb bands was quantified in three 
in the presence of parkin. His- UBCH7 was charged with N-terminal experiments, and were subjected to statistical analysis as shown in Fig. 4g. 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


Modela 


Extended Data Figure 8 | Structural modelling of the fully activated parkin. phosphorylated UBL domain (Model b) or phosphorylated ubiquitin (Model c) 
Based on the structure of the inactive, autoinhibited parkin (Model a), two to the RINGO domain supports a structural change in the RING2 domain. It is 
possible models of fully activated parkin with higher accessibility to catalytic possible that Model b is an intermediate for Model c, or vice versa. 

cysteine (Cys 431 in the RING2 domain) were generated. Binding of the 
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Extended Data Table 1 | A list of the ubiquitin peptides used in a parallel reaction monitoring assay to quantify phosphorylated ubiquitin 


Peptide sequence 


Precursor m/z (charge state) 


Product ions for PRM 


ESTLHLVLR 


534.314 (+2) 


Ys", Ya", V5", Yo's V7" 


ESTLHLVL [Heavy] R 


537.823 (+2) 


ys", Ya", Y5"s Yes V7" 


EpSTLHLVLR 


574.297 (+2) 


Ys", Ya", Ys", Yo's V7" 


EpSTLHLVL [Heavy] R 


577.806 (+2) 


ys", Ya", Ys") Yo's V7" 


For each peptide, the optimal precursor ion and product ions are shown. 
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Oncogene-like induction of cellular invasion from 


centrosome amplification 


Susana A. Godinho'+, Remigio Picone’?, Mithila Burute**°, Regina Dagher’”, Ying Su°, Cheuk T. Leung’}, Kornelia Polyak®, 


Joan S. Brugge’, Manuel Théry** & David Pellman’” 


Centrosome amplification has long been recognized as a feature of 
human tumours; however, its role in tumorigenesis remains unclear’. 
Centrosome amplification is poorly tolerated by non-transformed 
cells and, in the absence of selection, extra centrosomes are sponta- 
neously lost”. Thus, the high frequency of centrosome amplification, 
particularly in more aggressive tumours’, raises the possibility that 
extra centrosomes could, in some contexts, confer advantageous 
characteristics that promote tumour progression. Using a three- 
dimensional model system and other approaches to culture human 
mammaty epithelial cells, we find that centrosome amplification trig- 
gers cell invasion. This invasive behaviour is similar to that induced 
by overexpression of the breast cancer oncogene ERBB2 (ref. 4) and 
indeed enhances invasiveness triggered by ERBB2. Our data indicate 
that, through increased centrosomal microtubule nucleation, centro- 
some amplification increases Racl activity, which disrupts normal 
cell-cell adhesion and promotes invasion. These findings demonstrate 
that centrosome amplification, a structural alteration of the cytos- 
keleton, can promote features of malignant transformation. 

The centrosome is the major microtubule-organizing centre in mam- 
malian cells and comprises of a pair of centrioles surrounded by the 
pericentriolar material’. Centrosome abnormalities, usually increased 
numbers, are common in human tumours’ and have been positively 
associated with advanced tumour grade and metastasis’, suggesting a 
possible role in tumour progression. This is somewhat surprising given 
the well-documented deleterious effects of centrosome amplification 
on cell proliferation®; such amplification can be lethal if it compromises 
the ability of cells to organize multiple centrosomes to generate pseudo- 
bipolar spindles”. These seemingly paradoxical observations suggest that 
centrosome amplification might enhance other aspects of tumorigenesis. 

We have developed orthogonal approaches to generate genetically 
comparable cells that do or do not carry extra centrosomes’. Here we adapt 
these methods to determine how centrosome amplification influences 
epithelial organoid integrity, making use of the well characterized three 
dimensional (3D) culture model for MCF10A cells, a non-transformed 
human mammary epithelial cell line. This model recapitulates many 
aspects of breast glandular architecture’. 

We engineered MCF10A cells to enable the inducible overexpression 
of Polo-like kinase 4 (PLK4), an essential regulator of centrosome dup- 
lication, whose overexpression induces supernumerary centrosomes*”. 
Asa negative control, we transiently overexpressed a truncated form of 
PLK4 (PLK4' °°) that retains kinase activity but does not induce cen- 
trosome amplification’’. As expected, transient induction of PLK4, but 
not of PLK4'~©8 Jed to centrosome amplification (Fig. 1aand Extended 
Data Fig. 1). Strikingly, centrosome amplification induced by PLK4 
resulted in the formation of invasive protrusions, cytoplasmatic exten- 
sions that invade the surrounding matrix (Fig. 1b and Extended Data 
Fig. 1f, g). Expression of centrinl-GFP to visualize the centrioles revealed 
that virtually all cells with invasive protrusions exhibited centrosome 


amplification (Fig. 1c). An independent approach, using an organotypic 
culture system to assay for fibroblast-led collective migration, confirmed 
that centrosome amplification promotes invasion, both of MCF10A cells 
and non-transformed keratinocytes (HaCaTs) (Fig. 1d and Extended 
Data Fig. 1h). 

Cytokinesis failure was induced in MCFIOA cells using dihydrocyto- 
chalasin B (DCB) to generate centrosome amplification without PLK4 
overexpression. Newly generated tetraploid cells, with doubled centro- 
some content, were isolated by fluorescence-activated cell sorting (FACS). 
A control population of tetraploid cells in which extra centrosomes were 
spontaneously lost were generated, as previously described? (evolved 
tetraploids, 4N.evo, Extended Data Fig. 2a—e). Tetraploid cells with extra 
centrosomes were invasive in 3D cultures, whereas 4N.evo cells were 
not (Fig. le). PLK4 overexpression in 4N.evo cells induced centrosome 
amplification accompanied by invasive protrusions, demonstrating that 
4N.evo cells still retained the ability to become invasive (Extended Data 
Fig. 2g, h). 

Invasive protrusions are accompanied by the degradation of laminin-V 
(Fig. 1f) and collagen-I (Extended Data Fig. 1i), contain actin and micro- 
tubules (Extended Data Fig. 3a) and are surrounded by the extracellular 
matrix component fibronectin (Extended Data Fig. 3b). Consistent with 
centrosome amplification promoting matrix degradation, the invasive 
phenotype was partially suppressed by inhibition of metalloproteinases 
using marimastat (Extended Data Fig. 3c). Live-cell imaging showed 
that protrusions are highly dynamic, constantly extending and retract- 
ing (Supplementary Videos 1 and 2), which may partially explain why 
only a fraction of acini with extra centrosomes exhibits invasive pro- 
trusions at a given time (Fig. 1c). The formation ofan initial protrusion 
provided a track for the collective migration of multiple cells out of the 
acinus and into the surrounding matrix (Extended Data Fig. 3d, e and 
Supplementary Video 3). This type of collective invasion resembles what 
has been observed in tumours in vivo. Indeed, many solid tumours typ- 
ically exhibit collective invasion, which often involves the degradation 
of the extracellular matrix". 

Invasion induced by centrosome amplification strongly resembled that 
induced by a bona fide breast cancer oncogene, ERBB2 (ref. 4) (Extended 
Data Fig. 4). As in ERBB2 tumours”, cells with extra centrosomes retained 
the expression of E-cadherin (Extended Data Fig. 3f), suggesting that 
mechanisms other than a classical epithelial-to-mesenchymal transi- 
tion account for the invasive phenotype. Importantly, when combined 
with ERBB2 overexpression, centrosome amplification enhanced the 
frequency of invasive acini (Fig. 1g). 

The induction of invasive protrusions by extra centrosomes could be 
an indirect consequence of aneuploidy that results from chromosome 
missegregation””’. To directly address this possibility, we depleted MCAK 
(also known as KIF2C), a kinesin important for chromosome segregation 
during mitosis’*, to induce a comparable degree of aneuploidy before 
(48 h after PLK4 induction) and after 4 days in 3D culture (Fig. 2a, b). 


1Howard Hughes Medical Institute, Department of Pediatric Oncology, Dana-Farber Cancer Institute and Pediatric Hematology/Oncology, Children’s Hospital, Boston, Massachusetts 02115, USA. 
?Department of Cell Biology, Harvard Medical School, Boston, Massachusetts 02115, USA. 3Institut de Recherche en Technologie et Science pour le Vivant, UMR5168 CEA/UJF/INRA/CNRS, Grenoble, 
France. *Hépital Saint Louis, Institut Universitaire d’Hematologie, U1160 INSERM/AP-HP/Université Paris Diderot, Paris 75010, France. °CYTOO SA, Grenoble 38054, France. °Department of Medical 
Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, Massachusetts 02115, USA. +Present addresses: Barts Cancer Institute, Queen Mary University of London, Charterhouse Square, 
London EC1M 6BQ, UK (S.A.G.); Department of Pharmacology, University of Minnesota, Minneapolis, Minnesota 55455, USA (C.T.L). 
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Figure 1 | Invasive behaviour of epithelial cells triggered by centrosome 
amplification. a, Cells stained for microtubules («-tubulin, red), centrioles 
(centrin2, green) and DNA (Hoechst, blue). Scale bar, 10 um. Histogram shows 
fraction of cells with centrosome amplification. Error bars represent 

mean ~ s.e. from 3 independent experiments. b, Fraction of invasive acini 
in 3D cultures and representative images of normal acinus and an acinus 
with invasive protrusions. Scale bar, 10 tm. Error bars represent mean + s.e. 
from 4 independent experiments. c, Cells stained for F-actin (red), centrioles 
(centrinl1-GFP, green, inset white) and DNA (blue). Scale bar, 10 jum. 
Histogram shows fraction of acini with centrosome amplification after PLK4 
overexpression. Error bars represent mean + s.e. from 3 independent 
experiments. d, Scheme of the organotypic culture model used to assess 
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invasion (top). Images show haematoxylin and eosin (H&E) staining of 
sections of MCF10A cells plated on the organotypic model, with and without 
fibroblasts (black arrows indicate highly invasive areas). Percentage of invasion 
(mean = s.d.): —Dox = 11.7 + 0.83; +Dox = 26.1 + 6.5. Scale bar, 100 im. 
e, Fraction of invasive acini in tetraploids. Error bars represent mean = s.e. 
from 3 independent experiments. f, Acini stained for laminin-V (green), 
F-actin (red) and DNA (blue). White arrow indicates laminin-V degradation. 
Scale bar, 10 jum. g, Fraction of invasive acini in cells that overexpress ERBB2, 
with or without centrosome amplification. Error bars represent mean + s.e. 
from 3 independent experiments. All P values were derived from unpaired 
two-tailed Student’s t-test (**P < 0.005; ***P < 0.0005). OE, overexpression. 
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Figure 2 | The induction of aneuploidy does not generate invasive acini. 
a, Western blot showing depletion of MCAK by siRNA (48h). The asterisk 
marks a non-specific band. b, Quantification of chromosome number, shown as 
the percentage deviation from the mode, in cells before adding to 3D cultures 
(48h) and after 3D cultures (~100 spreads were scored per condition). 
Significance was determined by chi-squared testing to compare the frequencies 
of nominal variables (*P < 0.05). siLuc, si luciferase control. c, Fraction of cells 
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with centrosome amplification. Error bars represent mean + s.e. from 3 
independent experiments. d, Fraction of invasive acini. Error bars represent 
mean = s.e. from 3 independent experiments. P value derived from unpaired 
two-tailed Student’s t-test (**P < 0.005, ***P < 0.0005; NS, not significant). 
e, SNP analysis of cells with extra centrosomes and depleted of MCAK. Shown 
are log, copy number ratios of the indicated samples relative to their controls. 
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Cells depleted of MCAK neither increased centrosome number nor exhib- 
ited a significant increase in invasive protrusions (Fig. 2c, d). Similarly, 
aneuploidy, generated by inhibition of the spindle assembly checkpoint 
kinase MPS1 (using the inhibitor reversine)’” also failed to induce inva- 
sion (Extended Data Fig. 5a-c). In addition, although 4N.evo cells exhibit 
substantial aneuploidy, they did not form invasive acini (Fig. le and 
Extended Data Fig. 2f). Finally, single-nucleotide polymorphism ana- 
lysis (SNP-arrays) demonstrated that neither cells with extra centro- 
somes nor depletion of MCAK accumulate a recurrent aneuploidy after 
4 days in 3D culture (Fig. 2e and Extended Data Fig. 5d). Thus, aneu- 
ploidy per se is not responsible for the invasive behaviour. 
Centrosome amplification also leads to altered cilia signalling’® and 
to increased levels of p53 (ref. 17). However, we found that neither the 
parental MCF10A cells nor the derivatives with extra centrosomes formed 
detectable cilia after 4 days in 3D culture. Moreover, depletion of p53 
did not alter centrosome amplification mediated invasion (Extended 
Data Fig. 6a—d). In a transplant model using asymmetrically dividing 
neuroblasts from Drosophila melanogaster, centrosome amplification 
induces tumours that, interestingly, are capable of metastasis'*. Centro- 
some amplification in this system disrupts asymmetric cell division, result- 
ing in stem cell expansion, potentially contributing to tumorigenesis. 
Because MCFIOA cells do not undergo asymmetric cell division, this 
mechanism does not apply to our results. In the Drosophila neuroblasts, 
centrosome amplification may also disrupt cell polarization, which can 
trigger tumorigenesis in many systems. Although we cannot exclude 
effects on cell polarization in our system, we do note that MCF10A cells 
cannot form tight junctions and do not exhibit polarization of the apical 
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Par3—Par6-aPKC complex’’. Moreover, centrosome amplification does 
not impair the ability of MCF10A cells to asymmetrically position cen- 
trosomes, the main detectable polarization in these cells (Extended Data 
Fig. 6e, f). 

Insight into why cells with extra centrosomes are invasive in 3D cul- 
tures first came from observing the adhesive properties of single cells 
after cell division. As expected because they are epithelial cells, following 
mitosis, MCF10As formed cell-cell contacts, and remained as apposed 
cell pairs. By contrast, cells with extra centrosomes ‘scattered’, result- 
ing in a high fraction of individual cells (Extended Data Fig. 7a, b and 
Supplementary Videos 4 and 5), a characteristic associated with loss of 
cell-cell adhesion. Furthermore, live cell imaging in cells with a fluores- 
cent membrane marker revealed that cell-cell contacts are not stable and 
often overlap in cells with extra centrosomes (Fig. 3a, Supplementary 
Videos 6 and 7). As a direct measure of cell-cell junction integrity, we 
plated cells onto fibronectin micropatterns that were specifically designed 
to promote the formation of adherens junctions at a stereotypical posi- 
tion between the two cells (Fig. 3b)”°. Consistent with the cell scattering 
effect, centrosome amplification produced marked defects in cell-cell 
junction positioning and size (Fig. 3c, d). This effect of centrosome amp- 
lification is similar to that described for loss of p120 catenin, whose 
knockdown weakens cell-cell contacts”, although centrosome ampli- 
fication does not affect p120 levels (Extended Data Fig. 7c). 

These phenotypic characteristics induced by centrosome amplifica- 
tion are similar to what has previously been observed upon activation 
of Racl (refs 21, 22), a small GTPase strongly associated with oncogenic 
signalling and with the induction of invasiveness and metastasis”**. 


Figure 3 | Centrosome amplification disrupts 
normal cell-cell adhesion because of Racl 
activation. a, Kymograph analysis of cell-cell 
adhesion in live cells visualized with a plasma 
membrane marker (Raichu-Racl, CFP 
fluorescence). Red arrows mark areas of increased 
dynamics of cell-cell contacts. b, Scheme of the 
micro-pattern used. c, Images of cells (after 
mitosis) on micro-patterns labelled for E-cadherin 
(GFP; green), F-actin (red), DNA (blue) and 
fibronectin micro-pattern (green). Normal sized 
junction with a normal position (narrow angle) 
(top); abnormal junction position (middle); 
smaller junction size (bottom). Scale bar, 10 jim. 
d, Distribution of the cell-cell junctions angles, as a 
measure of junction positioning, (left) and size 
(right). For -Dox, n = 143; +Dox, n = 216. Error 
bars represent mean + s.e. e, Representative 
western blot from a pull-down experiment to detect 
GTP-bound Racl. Histogram shows quantification 
from 3 independent pull-down experiments. Error 
bars represent mean + s.e. f, Top shows examples 
of FRET ratiometric images; colour-coded scale 
represents the level of Racl activation. Scale bar, 
10 um. Bottom, levels of active Racl measured by 
FRET. For —Dox, n = 50; + Dox, n = 38. Error bars 
represent mean + s.e. g, Inhibition of Racl with 
NSC23766 in cells plated on micro-patterns. 

For —Dox, n = 80; + Dox n = 119; +Dox + 
NSC23766, n = 162. h, Fraction of invasive acini 
after treatment with NSC23766 and images of 
control and NSC23766-treated acini. Scale bar, 

20 um. Error bars represent mean = s.e. from 3 
independent experiments. All P values were 
derived from unpaired two-tailed Student’s t-test 
(***P << 0,0005). 
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This motivated the hypothesis that extra centrosomes might promote 
invasive-like effects through inappropriate activation of Racl. Indeed, 
we found that centrosome amplification induced a consistent ~1.5-fold 
Racl activation using a biochemical pull-down assay to measure GTP- 
bound Racl in multiple cell lines. Maximal Racl activity induced by 
EGF in MCF10A cells is ~2-fold (Fig. 3e and Extended Data Fig. 7d, e). 
This was confirmed by monitoring Rac] activation in single cells using the 
Raichu-Racl fluorescence resonance energy transfer (FRET) biosensor” 
(Fig. 3f and Extended Data Fig. 7f, g). Consistent with the commonly 
reported antagonism between Racl and RhoA, we found that cells with 
extra centrosomes have decreased active RhoA (Extended Data Fig. 7h). 
Thus, centrosome amplification in MCF1O0A cells activates Racl. 
Small molecule Rac] inhibitor (NSC23766) inhibited Racl activation 
and partially rescued defects in cell-cell adhesion, suggesting that Racl 
activation is responsible for the cell-cell adhesion defect in cells with cen- 
trosome amplification, (Fig. 3g and Extended Data Fig. 7i). Similar results 
were obtained in tetraploid MCF10A cells (Extended Data Fig. 8a, b). 
Furthermore, treatment with CK-666, an inhibitor of the Arp2/3 complex, 
an actin nucleator and important downstream target of Racl (ref. 23), 
also partially rescued the defects in cell-cell adhesion induced by cen- 
trosome amplification (Extended Data Fig. 8c—e). These findings demon- 
strate that the cell-cell adhesion defects in cells with extra centrosomes 
are, to a substantial degree, caused by increased Arp2/3-dependent actin 
polymerization that occurs downstream of Rac] signalling. In addition, 
we found that Racl inhibition blocked the formation of invasive acini 
without impairing the ability of cells to form normal acini (Fig. 3h). 
Previous work has established that microtubule polymerization after 
nocodazole washout induces Rac] activation. This activation appears to 
require dynamic microtubules because it is suppressed by the microtubule- 
stabilizing agent, paclitaxel”*. We considered the possibility that centro- 
some amplification induces Rac] activation through effects on centrosomal 


microtubule nucleation. As expected”, MCF10A cells with extra cen- 
trosomes display elevated levels of centrosomal y-tubulin (Extended Data 
Fig. 9a-c) and an increased capacity for microtubule-nucleation (Fig. 4a, 
note that amplified centrosomes are almost always clustered in inter- 
phase). Moreover, paclitaxel blocked the activation of Racl in cells with 
extra centrosomes (Fig. 4b), indicating a similar requirement for dynamic 
microtubules”’. These results were independently confirmed when Racl 
activity was monitored by FRET (Fig. 4c). Finally, Racl activation mea- 
sured by FRET was also observed in cells with extra centrosomes deprived 
of EGF (Extended Data Fig. 9d), indicating that this Racl activation is 
independent of any effects on growth factor signalling. Furthermore, 
induction of centrosome amplification (at 48 h) does not alter the cell 
cycle profile (Extended Data Fig. 9e) arguing against cell cycle effects 
as the cause for Racl activation in cells with extra centrosomes. 

We next examined the impact of increased centrosomal microtubule 
nucleation on cell-cell adhesion and the development of invasive struc- 
tures. To do so, centrosomal microtubule nucleation was moderately 
decreased by RNAi-mediated knockdown of CEP192, which encodes a 
centrosomal protein required for interphase recruitment of y-tubulin 
to the centrosomes”. As described’*, this protocol decreases centroso- 
mal y-tubulin without affecting centrosome number (Extended Data 
Fig. 10a, b, d, e). Consistent with our hypothesis, the depletion of CEP192 
inhibited Racl activation and restored normal cell-cell adhesion among 
cells with centrosome amplification (Fig. 4d—f). Most notably, partial 
depletion of CEP192 after short hairpin RNA (shRNA) treatment fully 
suppressed the invasive phenotype in cells with extra centrosomes with- 
out compromising cell viability or centrosome amplification, even 
after 4 days in 3D cultures (Fig. 4g and Extended Data Fig. 10c, f-h). 
These results indicate that increased centrosomal microtubule nucle- 
ation in cells with extra centrosomes triggers invasion. Although it remains 
unclear how dynamic microtubules activate Racl (ref. 29), our data 
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indicates that Racl activation downstream of microtubule nucleation 
plays a central role in this phenomenon. 

Deregulated Rac] activity has been implicated in the pathogenesis of 
many tumour types and is known to drive tumour invasion and metastasis™. 
Previous studies have demonstrated that Racl activation in tumours 
can be regulated by many mechanisms™. Our data indicate that centro- 
some amplification is probably another common mechanism for Racl 
activation during tumorigenesis, and that centrosome amplification may 
augment other oncogenic signals. In addition to Rac] activation, additional 
mechanisms probably contribute to the effects of centrosome amp- 
lification. Gene expression analyses of 3D cultures showed that TGF-B 
pathway, involved in metastasis formation and associated with tumour 
aggressiveness”, is strongly upregulated in cells with extra centrosomes 
(S.A.G. and D.P., unpublished data). 

Many studies have noted a positive correlation between centrosome 
amplification and advanced-stage tumours, recurrence and poor survival’, 
yet the mechanistic basis for this correlation has remained unclear. Here 
we provide evidence that centrosome amplification can mimic and accen- 
tuate the effects of oncogenes in triggering cellular invasion. These find- 
ings illustrate the integral relationship between cellular signalling and 
the cytoskeleton, underscoring the importance of this relationship to 
tumour progression. 


METHODS SUMMARY 


Human mammary epithelial MCF10A cells were maintained at 37 °C with 5% CO, 
atmosphere. PLK4 overexpression to induce centrosome amplification was induced 
by treatment with 2 jg ml’ of doxycycline (48 h). Alternatively, centrosome amp- 
lification was induced by inhibition of cytokinesis with 4 1M of DCB (18h) and 
tetraploid cells were isolated by FACS, as previously described’. A more detailed 
description of methods used can be found in the Methods section. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Cell culture. Human mammary epithelial MCF10A cells were maintained at 
37°C with 5% CO, atmosphere and cultured as previously described*'. Briefly, 
MCFIOA cells were grown in DMEM/F12 (Invitrogen) supplemented with 5% 
donor horse serum (Sigma), 20ngml’ epidermal growth factor (EGF; Sigma), 
10 yg ml’ insulin (Invitrogen), 100 pg ml * hydrocortisone (Sigma), 1 ng ml? 
cholera toxin (Sigma), 100 U ml! penicillin and streptomycin (Invitrogen). The 
MCFIOA cell line overexpressing the human ERBB2 gene (MCF10A.ErbB2 or 
MCF10A.NeuN) was previously characterized****. For 3D cultures, cells were grown 
in the same medium with reduced horse serum (2%) and EGF (5 ng ml ~ 1). To assay 
invasion in 3D cultures, cells were grown in a mix of Matrigel:collagen-I, as previ- 
ously described**. The addition of collagen-I to Matrigel facilitates invasion by increa- 
sing matrix stiffness*’. We used growth factor-reduced Matrigel (BD Biosciences) lots 
with protein concentrations between 9 and 11 mg ml” '. Collagen-I (BD Biosciences) 
was used at 1.6 mg ml *. Cells were grown for 4 days in 3D cultures before quan- 
tification of invasion. Between 200-300 acini were scored per condition for each 
experiment. To assess collagen-I degradation in 3D cultures, we added to the Matrigel: 
collagen-I mix 25 pg ml” ' of quenched DQ-collagen-I (Molecular Probes). After 
degradation, DQ-collagen-I becomes fluorescent. 

To collect cells from 3D cultures, we incubated cells with Dispase (BD Bio- 
sciences) for 30-60 min at 37 °C, according to the manufacturer’s instructions. 
Cell aggregates were then trypsinized to obtain single cells suspensions and pro- 
cessed either for western blotting or to prepare chromosome spreads. 

The alveolar epithelial cell line 16HBE, provided by Alan Hall, was cultured in 

MEM (Invitrogen) supplemented with GlutaMAX, Earle’s salts, 10% of FBS and 
100 U ml ' penicillin and streptomycin**. The non-transformed keratinocyte line 
(HaCaT), primary breast fibroblasts (1492N) and skin fibroblasts (HDFs) were 
provided by John Marshall. HaCats and HDFs were cultured in DMEM supple- 
mented with 10% of FBS and 100 U ml penicillin and streptomycin, 1492N were 
cultured in 50:50 Ham’s F12:DMEM supplemented with 10% of FBS and 
100U ml ' penicillin and streptomycin We used tetracyclin-free FBS (Hyclone) 
to grow the 16HBE and HaCaTs cells expressing the PLK4 construct to inhibit 
PLK4 expression in the absence of doxycycline. 
Organotypic culture system. The organotypic culture system was adapted from 
previously described methods*”*. Briefly, 200 il of culture media containing epi- 
thelial cells or epithelial cells plus fibroblasts was added on top of the polymerized 
Matrigel:collagen-I mixture in each well. Co-culture of epithelial cells with fibro- 
blasts was carried out as follows: HaCaTs cells were grown with HDF fibroblasts 
and MCF1O0A cells were grown with human primary fibroblasts (1492N). 600 kl of 
culture media was added to the bottom of the transwell. Fibroblasts invade the 
Matrigel:collagen-I layer whereas most of the non-invasive epithelial cells grow on 
top of this layer. Medium was changed every 2 days. Cells were grown for 7 days 
before samples were fixed and stained. 

Cells in transwells were fixed in formalin overnight at room temperature. 
Inserts were removed using scalpels and placed in 70% ethanol and processed 
for histology. Paraffin embedded inserts were sectioned and stained for haema- 
toxylin and eosin (H&E). For both MCF10A and HaCaTs, the percentage invasion 
was calculated as the number of cells that entered the Matrigel:collagen-I layer 
relative to the total number of cells per each well. 

Lentiviral and retroviral vectors. For the generation of the inducible PLK4 over- 
expression system we used the lentiviral vectors pLenti-CMV-TetR-Blast (17492, 
Addgene) and pLenti-CMV/TO-Neo-Dest (17292, Addgene)*”. Wild-type PLK4 
and PLK4'°* cDNAs were cloned using the Gateway system into the pLenti- 
CMV/TO-Neo-Dest vector. Cells were first infected with a lentivirus containing 
the TetR and selected with Blasticidin (10-5 pg ml _!). After selection, cells were 
infected with the lentivirus containing the wild-type PLK4 or PLK4' © transgenes 
and selected with Geneticin (200-100 yg ml’). Note that no clones were selected 
at any point and that all the selected cells were pooled to make a population. All the 
cell lines generated were induced with 2 1g ml ' of doxycycline for 48h to induce 
the expression of the transgenes. pLKO.1 lentiviral vector containing Cep192 
shRNA hairpin sequence was obtained from the RNAi consortium (TCR) at the 
Broad Institute (5'’-CCCGG GAGGCATCAGTTAATACTGAT-CTCGAG-AT 
CAGTATTAACTGATGCCTCTITTTTTG-3’). pLKO.1 lentiviral vector expres- 
sing p53 shRNA was obtained from Addgene (19119)*°. Lentilox Centrinl-eGFP 
construct was a gift from J. Loncarek. pLenti6/V5 lentiviral vector expressing 
H2B-GFP was cloned as previously described’. The Raichu-Racl biosensor, a gift 
from M. Matsuda, was subcloned into the retroviral vector pWZL-blast using the 
restriction sites EcoR1 and Sall. 

Chemicals. Doxycycline (Sigma) was used at 21g ml *. The following doses of 
inhibitors were used: 25 uM NSC23766 (EMD Millipore), 50 ,tM CK-666 (Sigma), 
0.1 LM Reversine (Cayman Chemical), 10 mM paclitaxel (Sigma), 4 uM of dihy- 
drocytochalasin B (DCB; Sigma), 200 ng ml! doxorubicin (Sigma), and 5 uM and 
10 1M of marimastat (BB-2516; Sigma). 


2D indirect immunofluorescence microscopy. Cells plated in glass coverslips 
were washed in PBS and fixed with 4% of PFA for 15 min at room temperature 
(RT). For centriole/centrosome staining, cells were fixed with ice-cold methanol 
at —20°C for 10 min. Following fixation, cells were permeabilized with PBS and 
0.2% Triton X-100 for 5 min, blocked in PBST (PBS, 5%BSA, 0.1% Triton X-100) 
for 30 min, and then incubated with primary antibodies in PBST for 60 min. Cells 
were washed with PBS and incubated with species-specific fluorescent secondary 
antibodies (Alexa-conjugated, Molecular Probes). DNA was stained with Hoechst 
33342 (1:5,000; Invitrogen) for 5min in PBS. Coverslips were mounted with 
ProLong Antifade mounting medium (Molecular Probes). Antibodies used included 
anti «-tubulin DM1a (1:500; Sigma-Aldrich), anti-centrin2 (1:100; Santa Cruz), 
anti y-tubulin GTU88 (1:500; Sigma-Aldrich) and anti B-catenin (1:500; Abcam). 
Phalloidin was used to stain F-actin (1:250; AlexaFluor 568; Invitrogen). Images 
were collected with a Yokogawa CSU-22 spinning disk confocal mounted on a 
Zeiss Axiovert microscope using 404, 488 and 561 nm laser light. Captured images 
from each experiment were analysed with Slidebook software (Intelligent Imaging 
Innovations). Measurement of centrosome number was performed in mitotic cells, 
~100 were scored in each experiment. 

3D indirect immunofluorescence microscopy. Immunofluorescence of 3D cul- 
tures was carried out as previously described*’. Briefly, cells grown in 3D cultures 
were washed with 1X PBS and fixed in 5% of formalin (Sigma) in PBS for 20 min at 
37°C. After fixation cells were rinsed 3 times, 10min each, with PBS:glycine 
(100 mM) and permeabilized with 0.5% Triton X-100 in PBS for 10 min. Cells 
were blocked with 10% of goat serum (Sigma) in IF buffer (130 mM NaCl, 7mM 
Naz:HPOx, 3.5 mM NaH>POg,, 7.7 mM NaN3, 0.1% BSA, 0.2% Triton X-100, 0.05% 
Tween-20) for 1 h at room temperature and primary antibodies were incubated in 
the same solution over night at 4 °C. Cells were washed 3 times, 20 min each, with 
IF buffer. When required, cells were incubated with secondary antibodies for 1 h at 
room temperature (Alexa-conjugated, Molecular Probes). Cells were washed twice 
with IF buffer and once with PBS followed by incubation with Hoechst 33342 
(1:2,500; Invitrogen) for 20min. 3D cultures were then mounted in ProLong 
Antifade mounting medium (Molecular Probes). Antibodies used included anti 
a-tubulin FITC conjugated DM1a (1:100; Sigma), anti Laminin-V AlexaFluor 
488 conjugated (1:100; Millipore), anti-acetylated-tubulin 611B-1 (1:100; Sigma), 
anti-fibronectin (1:100; BD Biosciences) and anti-Pericentrin (1:100; Abcam). 
Phalloidin was used to stain F-actin (1:100; AlexaFluor 568; Invitrogen). Images 
were collected with a Nikon A1R laser scanning confocal head mounted on a Nikon 
Ti-E motorized inverted microscope using 404, 488 and 561 nm laser light. Cap- 
tured images from each experiment were analysed using NIS-Elements software 
(Nikon). 

Long-term live-cell imaging. H2B-GFP expressing cells were grown on glass- 
bottom 12-well tissue culture dishes (MatTek) for 2D imaging or in Lab-Tek cham- 
bered coverglass for 3D imaging. Cells were imaged on a Nikon TE2000-E2 inverted 
microscope equipped with a cooled CCD camera (TE2000, Orca ER, Hamamatsu; 
Ti-E, Coolsnap HQ2, Photometrics), a precision motorized stage (Bioprecision, 
Ludl), and Nikon Perfect Focus. Microscope was enclosed within temperature- 
and CO,-controlled environments that maintained an atmosphere of 37°C and 
3-5% humidified CO2. GFP and bright field images were captured at multiple points 
every 4 min for 1-2 days with either 10X (0.3 NA) or 20X (0.5, 0.75 NA) objectives. 
Captured images from each experiment were analysed using NIS-Elements software. 
Immunofluorescence quantification. Quantification of y-tubulin fluorescence 
intensity associated with the centrosomes was performed as previously described"". 
Images were acquired with a 100X NA 1.45 Plan Apo objective. Stacks of 6 images 
with 0.6 jm step size were collected. Step size was calculated to have minimal pixel 
overlapping between steps. SUM intensity projections of the images were used to 
quantify fluorescence intensity using ImageJ. Computer generated 50 x 50 and 
80 X 80 pixel regions were centred over each centrosomes (as shown in Extended 
Data Fig. 9). The intensity value measured for the 50 < 50 pixel region include both 
centrosome and background fluorescence. Background fluorescence was obtained 
by subtracting the integrated value of 50 X 50 pixel region from the larger 80 X 80 
pixel region. Integrated centrosomal fluorescence intensity was calculated by sub- 
traction of the background fluorescence intensity from total fluorescence inten- 
sity (Extended Data Fig. 9). The advantage of this approach is that it controls for the 
non-homogeneity in background fluorescence. 

2D cell-cell adhesion analysis. MCF-10A cells expressing the Raichu-Rac (here 
used just to visualize the cell membrane), with and without centrosome amplifica- 
tion, were plated on glass-bottom tissue culture dishes (MatTek) coated with 
20 wg ml~* fibronectin (Sigma-Aldrich) for 20 min. Centrosome amplification 
was induced by 48h PLK4 induction, as described above. To monitor the cell- 
cell contact dynamics, we acquired time-lapse series of images, visualizing the CFP 
moiety of the Raichu-Rac FRET reporter (Fig. 3a and Supplementary Videos 6 
and 7) at 5 min intervals over 12.4h. To illustrate differences in cell-cell contact 
dynamics, we generated kymographs (Fig. 3a). The kymograph was obtained by 
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sequentially mounting a 1 X 336 (pixel) region of interest from the videos, using 
ImageJ. Time-lapse series were collected with a Nikon inverted microscope with 
epi-fluorescence optics using a 40X plan Apo NA 1.4 objective. The microscope 
was equipped with a Nikon Perfect Focus System and a Hamamatsu ORCA ER 
cooled CCD camera and controlled with Nikon NIS-Element software. To reduce 
illumination intensity, minimizing phototoxicity and photobleaching, we used a 
ND8 (1/8 transmission) neutral density filter. 

Western blotting. Cells were collected and ressuspended in Laemmli buffer and 
proteins were separated on sodium dodecyl sulphate polyacrylamide gel electro- 
phoresis (SDS-PAGE) and transferred onto PVDF membranes. Antibodies used 
included anti «-tubulin DM1a (1:2,000; Sigma), anti-CEP192 (1:1,000; gift from L. 
Pelletier), anti- MCAK (1:1,000; Bethyl Laboratories), anti-p53 (1:1,000, Cell Signaling), 
anti-P-p53 (1:1,000, phospho-Ser15, Cell Signaling), E-cadherin (1:1,000, Invitrogen) 
and anti-Racl (1:1,000, BD Biosciences). Images were acquired using ImageQuant 
LAS4000 (GE Healthcare) and when required band intensity was quantified using 
Image]. 

siRNA. siRNA was performed using Lipofectamine RNAiMax (Invitrogen) 
according to the manufacturer’s instructions. 50 nM of CEP192, MCAK and luci- 
ferase (negative control) siRNA was used per well in a 6-well plate. After 6h of 
incubation, transfected cells were washed and normal growth medium was added. 
Cells were analysed at 48 h after transfection. ON-Target plus SMART pools were 
used for MCAK and CEP192 siRNA (Dharmacon). Human KIF2C/MCAK siRNA 
(L-004955-00-0005): GGCAUAAGCUCCUGUGAAU (J-004955); CCAACGCA 
GUAAUGGUUUA (J-004955-07); GCAAGCAACAGGUGCAAGU (J-004955- 
08); UGACUGAUCCUAUCGAAGA (J-004955-09). Human CEP192 siRNA (L- 
032250-01-0005): UGUGAAGAAAUACGAGAUA (J-032250-09); GCUCAGC 
GGUAUUUGGGAA (J-032250-10); GUCUAGAACUCGAGAAUCA (J-032250- 
11); GGUUGAAGCAGUAGAGAGU (J-032250-12). 

Single-nucleotide polymorphisms (SNP). Cells from 3D cultures were recovered 
after 4 days as described above (see Cell Culture section for details) and genomic 
DNA was prepared using DNA purification Kit (Qiagen) according to the man- 
ufacturer’s instructions. Genomic DNA was used to perform analysis with gen- 
ome-wide human SNP 6.0 arrays to determine copy number in MCF10A control 
cells, MCF10A after induction of centrosome amplification and MCF10A depleted 
of MCAK. SNP array data was analysed with the Nexus Copy Number Software 
(BioDiscovery; http://www. biodiscovery.com/software/nexus-copy-number/), using 
the ‘Matched Paired Analysis’ module. The SNP data discussed in this publication 
have been deposited in NCBI’s Gene Expression Omnibus” and are accessible through 
GEO accession number GSE55042. 

Microtubule polymerization assay. Cells plated in glass coverslips were trans- 
ferred to ice cold medium and incubated 1 h on ice to depolymerize microtubules. 
Cells were then incubated with medium at 37 °C for 30s to allow microtubules to 
polymerize and fixed immediately in ice-cold methanol for 10 min. Cells were 
stained for microtubules and centrioles (as described above) and microtubule num- 
ber was quantified manually. Images of cells used in the analysis were acquired from 
2 independent experiments. 

qRT-PCR. PLK¢ is highly unstable and we were not able to monitor its transient 
overexpression by western blotting. (RT-PCR was therefore used to analyse the 
extent of its expression. RNA was prepared using Qiagen RNAeasy kit according 
to the manufacturer’s instructions. For 3D cultures, RNA was initially harvested 
using TRIzol, and after chloroform extraction, the upper aqueous phase was used 
to purify RNA using the RNAeasy kit. 300 ng of RNA was used to produce cDNA 
using qScript CDNA SuperMix kit (Quanta Bioscience), according to the manu- 
facturer’s instructions. For RT-PCR, we used Power SYBR Green followed by 
analysis with ViiA PCR machine (Applied Biosystems). 

Primers used to assess the levels of PLK4 overexpression only amplify the exo- 
genous wild-type PLK4 or PLK4'-* sequences. The primers used for gRT-PCR 
were: PLK4 forward: 5'-CAGGATTTGCCCGGGATGGCG-3'; PLK4 reverse: 5’- 
AACCAGTGTGAATGGACTCAGCTCT-3’; GAPDH forward: 5'-TTAAAAG 
CAGCCCTGGTGAC-3’; GAPDH reverse: 5'-CTCTGCTCCTCCTGTTCGAC-3’. 
Racl-GTP pull-down. The Racl-GTP pull-down assay was performed using the 
Racl activation kit (Cytoskeleton) according to the manufacturer’s instructions. 
We used cells plated in one 10 cm dish per assay. Cells were resuspended in 400 jl 
of lysis buffer and 15 l of CRIB/PBD beads were used to pull-down active Racl. 
Extracts were incubated with the beads for 30 min. All the procedures were done at 
4°C and buffers were kept ice-cold. After washing, beads were resuspended in 
15 pl of Laemmli buffer and processed for western blotting. To inhibit microtubule 
dynamics, cells were treated with 10 mM of paclitaxel for 1 h’*. 

Generation of tetraploid cells with normal centrosome number. MCF10A cells 
were treated with 4 uM DCD for ~18 h, washed every 5 min over 30 min, and then 
FACS sorted by DNA content using Hoechst at 1:2,500 (Molecular Probes) to 
isolate tetraploid cells with extra centrosomes. To isolate tetraploid cells with 
normal centrosome number, cells with a DNA content of 8c (dividing tetraploid 
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cells) were isolated and cultured for ~8 days before a second FACS sorting to re- 
isolate 8c cells. By sort 4, nearly 100% of tetraploid cells (as assessed by FACS and 
karyotyping) had two centrosomes’. 

Chromosome spreads. MCF1OA cells were treated with 20ng ml‘ colcemid 
(Gibco) for 4h, trypsinized, resuspended in 75mM of KCl and incubated for 
30 min at 37°C. Cells were then fixed with 3:1 ice-cold methanol:acetic acid, 
(Carnoy’s solution) pelleted, and then washed three times more with methanol: 
acetic acid before being dropped on a pre-cleaned glass side. Cells were allowed to 
dry on the slide and were then stained for 3 min with Giemsa stain in 1X Gurt’s 
buffer (Gibco). Following a wash in Gurr’s buffer, coverslips were added to slides 
and sealed with Permount (Fisher). Images of spreads were taken with a 100 
objective on a Zeiss upright microscope and chromosomes were counted manually 
using Adobe Photoshop. The chi squared test of independence for nominal vari- 
ables was used to calculate the P value. 

FACS. Cells were fixed with 70% ethanol at 4°C followed by incubation with 
250 ug ml’ RNaseA and 10 yg ml’ propidium iodide (Invitrogen) at 37 °C for 
30 min. FACS analysis was performed with a FACSCalibur flow cytometer (Becton 
Dickinson) and data analysed with CellQuest software. 

Micropatterning. Glass coverslip micropatterning was performed as previously 
described**. Coverslips were first spin-coated with adhesion promoter Ti Prime 
(MicroChemicals) and then with 1% polystyrene in toluene at 3,000r.p.m. 
Polystyrene coated coverslips were oxidized through oxygen plasma (FEMTO; 
Diener Electronics) for 15s at 30 W before incubating with 0.1mgml~' PLL- 
PEG in 10mM HEPES pH 7.4 for 30 min. After drying, coverslips were exposed 
to deep ultraviolet (UV) (UVO cleaner, Jelight) through a photomask (TOPPAN) 
for 5 min. Right after UV activation, coverslips were incubated with 20 pg ml of 
fibronectin (Sigma), and 20 jig m1“ of fluorescent fibrinogen conjugate (Invitrogen) 
solution in PBS for 30 min. Coverslips were washed 3 times with sterile PBS before 
plating cells. Approximately 0.5 million cells were seeded onto micropatterned 
chips and were washed after 30 min to remove non-attached cells, which almost 
always resulted in single cells per micropattern. Cells were allowed to divide for 
16h prior to fixation. For Racl and Arp2/3 inhibition, cells were treated with 
NSC27633 (25 [tM) or CK-666 (50 1M), respectively, for 6 h, before fixation. For 
CEP192 siRNA, cells were plated after 48h of siRNA treatment. Images of cells 
used in the analysis were acquired from 2 independent experiments. The hourglass 
micropattern guides intracellular junction positioning as shown in Figure 3b, 
where most of the junction extremities are positioned in the region deprived of 
extracellular matrix (fibronectin). We measure the angle and size of the junctions 
to assess junction position and length in the different conditions. 

Note that when the cells divide on these patterns, spindle orientation is com- 
pletely random with respect to the geometry of the pattern. The cells also move 
after division before establishing normal cell-cell contacts, further altering the 
position of the prior division site relative to the site of eventual cell-cell contact. 
Thus, the final adhesion pattern is completely random relative to the initial (and 
random) orientation of the spindle, as previously described”. 

Racl FRET. For FRET experiments we used MCF10A cells stably expressing 
Raichu-Rac’*. Centrosome amplification was induced with Dox for 48h and 
~10,000 cells per ml of control and extra-centrosomes cells were plated onto 
20 ug ml fibronectin (Sigma-Aldrich) coated glass-bottom dishes/plates (MatTek) 
for 6h. For EGF depletion experiments cells incubated without EGF for 15h. 
FRET acquisition. Before acquisition, to increase the signal to noise, the culture 
medium was exchanged with 199 medium (Life Technologies) without serum and 
phenol red. For FRET imaging, cells were excited using a Nikon Intensilight source 
with a 430/24nm (for CFP) excitation filter with a dual band pass (ECFP/EYFP 
#89002) dichroic mirror and 470/24 (for CFP) and 535/40 (for FRET and YFP) 
emission filters from Chroma (Bellows Falls, VT). To reduce illumination intens- 
ity and thus minimize phototoxicity and photobleaching, we used a ND8 (1/8 
transmission) neutral density filter. All images were collected with Nikon Ti 
inverted microscope with epi-fluorescence optics equipped with a X40 plan 
Apo NA 1.4 object lens, perfect focus system and a Hamamatsu ORCA ER cooled 
CCD camera controlled with Nikon NIS-Element software. 

The following steps were taken to calculate the average magnitude and spatial 
variation of the camera’s noise for image correction of the acquired channels (CFP, 
FRET): (1) a sequence of 10 dark-current images were taken with 600 ms exposure 
time and 4 X 4 binning as the experimental images, but with no light incident on the 
CCD; (2) a sequence of 10 shade/illumination correction images for each channel 
(CFP and FRET) were taken with 600 ms exposure time and 4 X 4 binning as the 
experimental images, but acquired on a blank area without any objects. The average 
dark-current and shade images were calculated by averaging the acquired 10 images 
by using ImageJ. Single channel images (CFP, FRET) for each field view visualizing 
maximum four non-contacting cells were collected by using an exposure time of 
600 ms, 4 X 4 binning, a ND8 (1/8 transmission) neutral density filter and illumina- 
tion light shuttered between acquisitions. 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


FRET analysis. The Biosensor software from Dr. Gaudenz Danuser’s laboratory 
(http://lccb.hms.harvard.edu/index.html) along with a series of automated ImageJ 
macros were used to calculate the FRET ratio images. The acquired CFP and FRET 
images were corrected for the average dark-current, shading and background 
subtracted by using Biosensor or ImageJ. Then, single-cell images from each 
channel (CFP and FRET) were segmented by using the ImageJ minimum and 
mean threshold method, and the relative regions of interest (ROIs) were recorded. 
Single cell mask images having background and foreground (cell image) pixel 
values equal to zero and one, respectively, were calculated from each cell ROI 
by ImageJ. To set the image background pixels to zero, CFP and FRET images were 
multiplied by the corresponding calculated image masks. Finally the FRET ratio 
images were obtained by dividing the processed FRET image by the CFP image. 

Single cell average FRET/CFP values representing the Racl activation levels 
were obtained by calculating the mean pixel value of the FRET ratio images for 
each ROI (single cell). FRET/CFP values were collected from 2 independent exper- 
iments. An unpaired two-tailed Student’s t-test statistical analysis was used to obtain 
level of significance between different experiment conditions. 
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Extended Data Figure 1 | Characterization of cells after transient 
overexpression of PLK4 or PLK4’~®, a, Scheme of the experimental design 
to induce centrosome amplification. Transient overexpression of PLK4 and 
PLK4'~®°8 was achieved by addition of Dox for 48h (2D culture) followed by 
removal of Dox and growth in 3D culture for 4 days in the absence of Dox. 
b, qRT-PCR showing the levels of induction of the PLK4'-°°* and wild-type 
PLK4 transgenes in cells after 48h of Dox. Error bars represent mean + s.e. 
from 3 independent experiments. c, RT-PCR showing the expression of 
PLK4 after 48 h of Dox (2D) and after 4 days in 3D cultures (3D). Note that 
PLK4 overexpression after 4 days in 3D cultures is down to control 

levels after Dox removal. Error bars represent mean ~ s.e. from 3 independent 
experiments. d, Proliferation curve of cells after induction of PLK4 and 
PLK4~® over 72 h. Centrosome amplification decreases cell proliferation. 
Error bars represent mean + s.e. from 3 independent experiments. e, The 
fraction of cells with centrosome amplification at the indicated time points 
after PLK4 induction. Note that, because centrosome number is quantified in 
mitotic cells, this result demonstrates that cells with extra centrosomes can 
enter mitosis even after 72 h of Dox treatment. Error bars represent mean + s.e. 
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from 3 independent experiments. f, Fraction of cells with centrosome 
amplification in an independently generated MCF10A.PLK4 cell line. Error 
bars represent mean = s.e. from 3 independent experiments. g, Corresponding 
fraction of invasive acini in 3D cultures. Error bars represent mean + s.e. 
from 3 independent experiments. h, Centrosome amplification (PLK4 
overexpression, +Dox) in non-transformed keratinocytes (HaCaTs) promotes 
invasion in the organotypic culture model. Images show H&E staining of 
sections of HaCaT’ cells. Black arrows indicate cells invading the 

matrix. Note that the invasion of groups of cells was only detected in the 
+Dox condition (black arrowhead). Scale bar, 100 um. Graph shows 
quantification of the percentage of cells that invade. Each dot in the graphic 
represents the percentage of invasion in each individual well per 

experiment analysed. The P value was derived from unpaired two-tailed 
Student’s t-test *P < 0.05). i, Collagen-I degradation induced by 

centrosome amplification (green). Collagen degradation is visualized by 
DQ-Col-I, which becomes fluorescent after degradation because of fluorophore 
dequenching. Scale bar, 10 um. See Methods for more details. 
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Extended Data Figure 2 | Characterization of evolved diploid and tetraploid 
cells. a, Scheme of the experimental design to obtain fresh MCF10A 
tetraploid cells with extra centrosomes (4N) and ‘evolved’ tetraploid cells that 
lost the extra centrosomes (4N.evo), as previously described’. b, FACS 
profiles of ‘evolved’ diploid (2N.evo) and tetraploid cells (4N.evo). c, Western 
blotting to detect E-cadherin in the ‘evolved’ cells indicates that 4N.evo 
maintain epithelial characteristics. d, Representative images of metaphase 
chromosome spreads of 2N.evo and 4N.evo and quantification of chromosome 
number by karyotyping (~30 chromosome spreads were quantified in each 
condition). 4N.evo cells have a near-tetraploid karyotype. e, Centrosome 
amplification in diploid cells (2N or 2N.evo) newly generated tetraploid cells 
(4N) and evolved tetraploid cells (4N.evo). f, Quantification of the 


percentage of aneuploid cells in the ‘evolved’ cells. The 4N.evo cells are 
aneuploidy despite their near-tetraploid genomes (~30 chromosome spreads 
were quantified in each condition). g, Quantification of centrosome 
amplification of 4N.evo cells overexpressing PLK4. Error bars represent 
mean + s.e. from 3 independent experiments. h, Quantification of the invasive 
acini in 4N.evo cells after PLK4 overexpression. This experiment serves as a 
control to demonstrate that the 4N.evo cells retain their ability to amplify 
centrosomes and, after centrosome amplification, retain the capacity to form 
invasive acini. Error bars represent mean + s.e. from 3 independent 
experiments. P value derived from unpaired two-tailed Student’s t-test 

(**P < 0,005). 
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Extended Data Figure 3 | Characterization of invasive structures in cells 
with extra centrosomes. a, F-actin and microtubules in invasive protrusions: 
F-actin (red), microtubules («-tubulin, green) and DNA (blue). Insets show 
higher magnification images of the invasive protrusions. Scale bar, 10 1m. 

b, Fibronectin at invasive protrusions: cells were stained for F-actin (red), 
fibronectin (green) and DNA (blue). Scale bar, 10 um. ¢, Fraction of invasive 
acini in 3D cultures after treatment with the broad spectrum matrix 
metalloprotease (MMP) inhibitor, marimastat (BB-2516). Error bars represent 
mean = s.e. from 3 independent experiments. P value derived from unpaired 
two-tailed Student’s t-test (**P < 0.005). d, Images from videos of PLK4 
overexpression cells (Supplementary Video 3), showing nuclei (labelled with 
H2B-GFP) migrating into an invasive protrusion (red arrows). Time scale 
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shown as h:min. Scale bar, 20 um. e, Multiple cells can migrate into invasive 
protrusions. Cells were stained for F-actin (red), laminin-V (green) and 
DNA (blue). Red arrows mark cells that migrated into the invasive 
protrusion. Scale bar, 10 um. f, Western blot showing levels of E-cadherin, 
N-cadherin and vimentin in cells with (+ Dox) and without (— Dox) extra 
centrosomes before and after 4 days in 3D culture. The western blots 

show that, unlike cells treated with TGF-f, cells with extra centrosomes do not 
acquire a canonical epithelial-mesenchymal transition (EMT) phenotype. 
We do note a small increase in the levels of vimentin in cells with extra 
centrosomes before plating in 3D cultures. Dox treatment was given for 48h 
before 3D cultures in all experiments. 
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Extended Data Figure 4 | Similarity between cells with centrosome by cells overexpressing ERBB2, as previously reported*™. In both conditions, 
amplification and cells with oncogene-induced invasion. Cells were stained _ invasive protrusions are characterized by the formation of actin-rich 
for F-actin (red), laminin-V (green) and DNA (blue). Similarity between protrusions that are accompanied by degradation of the basement membrane. 


the invasive protrusions of cells with extra centrosomes and the ones generated —_ Scale bar, 10 um. 
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Extended Data Figure 5 | Reversine treatment induces aneuploidy but not _ treatment does not induce invasion. Error bars represent mean + s.e. from 3 


invasive acini in MCF10A cells. a, Quantification of the chromosome independent experiments. c, Images of 3D cultures after treatment with 
number in cells after treatment with with 0.1 1M of reversine (Rev) for 24h reversine showing normal appearing acini. Scale bar, 50 tm. d, SNP 

before and after 4 days in 3D cultures (~20 chromosome spreads were analysis of MCF10A cells (without centrosome amplification) compared 
quantified in each condition). The concentration of reversine used does not with Human Reference Genomic DNA 103 from Affymetrix. Previously 
induce cytokinesis failure and therefore would not induce centrosome reported genomic alterations in MCF10A cells can be detected in our analysis, 
amplification by inducing tetraploidy. b, Fraction of invasive acini after namely: +5q, +6p and +8q. 


MCF1O0A cells are treated with reversine. Increased aneuploidy from reversine 
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Extended Data Figure 6 | Invasive protrusions from 3D cultures of 
MCFI1O0A cells with extra centrosomes are not an indirect consequence of 
altered cilia signalling, increased p53 expression or defects in centrosome 
polarization. a, Cells in 2D were stained for pericentrin (green, inset), 
acetylated tubulin (red, inset) and DNA (blue). Cells were arrested for 48h in 
G1 to induce primary cilium formation. Note that even in this case most of the 
cells do not form cilia. This is expected because MCF10A cells have limited 
proficiency for cilia formation, with only ~7% of the cells assembling cilia even 
after 7 days of serum starvation”. Cells in 3D were stained for centrin (GFP, 
green inset), acetylated tubulin (red, inset) and DNA (blue). Cells do not form 
cilia after 4 days in 3D cultures. This is expected because, unlike MDCK cells, 
MCFIOA cells do not have a discernable apical polarity and lumen after 4 days 
in 3D cultures and thus are unlikely to form primary cilium at this time. b, As 
expected’’, centrosome amplification in MCF10A cells induces modest p53 
activation. Note that this degree of p53 activation has a minor effect on the 
proliferation of MCF10A cells (Extended Data Figure 1d). Expression of PLK4 
and PLK4!~©° was induced by Dox for the indicated times: 0, 24, 48 and 72h. 
c, Western blotting showing the levels of induction of p53 after doxorubicin 
treatment (200 ng ml’ for 4h) in control and p53-depleted cells, 
demonstrating that the p53 shRNA efficiently prevents p53 activation. 


d, Fraction of acini with invasive protrusions from cells with (+ Dox) or 
without (—Dox) centrosome amplification after depletion of p53. The ability of 
cells to form invasive acini is not significantly affected by their p53 status. Error 
bars represent mean + s.e. from 3 independent experiments. P values were 
derived from unpaired two-tailed Student’s t-test (***P < 0.0005; *P < 0.05). 
e, Cells were stained for «-catenin (green), y-tubulin (red), DNA (blue) and the 
fibronectin micro-pattern visualized in red. Dashed boxes outline the 
centrosomes. Note that after PLK4 overexpression, extra centrosomes 
(clustered in interphase) are correctly positioned towards the cell-cell junction 
(similar to the control) even when the junction is defective, suggesting that 
centrosome amplification is not impairing the polarity axis of these cells. 

f, Centrosome positioning is not altered in cells with (+ Dox) and without 
(control) extra centrosomes. Left image shows representative images of cells 
showing centrosomes (y-tubulin) in relation to adherens junctions 
(a-catenin). Right image shows scheme with quantification of the fraction of 
centrosomes at the indicated positions on the micropatterns (see Fig. 3a). 
Note that the position of centrosomes in cells with centrosome 

amplification does not differ from that in control cells. Cells were plated 

on the patterns 48 h after induction of centrosome amplification with PLK4. 
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Extended Data Figure 7 | Centrosome amplification induces cell scattering 
and Rac activation. a, Quantification of number of cells with (+Dox) or 
without (—Dox) extra centrosomes that remain as pairs within 10h time after 
mitosis. For -Dox n = 180; +Dox n = 98. Cell scattering occurred in most 
of the cases within the first 2h after mitosis. Cells were imaged on 2D 
substratum. Similar results were obtained with fixed cells. Error bars represent 
mean + s.e. from 3 independent experiments. b, Still images from videos 
showing examples of cells that stay together (— Dox) or move apart (+ Dox). 
Time scale shown as h:min. c, Western blot showing levels of p120 catenin in 
cells with (+Dox) and without (—Dox) extra centrosomes in 2D and 3D 
cultures. d, Western blot from a pull-down experiment to detect GTP-bound 
Racl in HaCaTs cells. Graph shows quantification of active Racl from pull- 
down experiments. Error bars represent mean = s.e. from 3 independent 
experiments. e, Western blot from a pull-down experiment to detect 
GTP-bound Racl in 16HBE cells. Graph shows quantification of active Racl 
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from pull-down experiments. Error bars represent mean = s.e. from 2 
independent experiments. f, FRET control demonstrating increased CFP 
emission after photobleaching of the YFP fluorophore at an excitation 
wavelength of 510 nm for 10 min in MCF-10A single cells expressing Raichu- 
Rac. g, Levels of active Racl measured by FRET in cells overexpression 
PLK4!-©®. For —Dox, n = 25; +Dox, n = 22. Error bars represent mean = s.e. 
h, Western blot from a pull-down experiment to detect GTP-bound RhoA in 
MCFI1O0A cells showing decrease RhoA activity in cells with extra centrosomes. 
i, Levels of active Racl measured by FRET in cells with extra centrosomes 
treated with the Racl inhibitor NSC23766, demonstrating that NSC23766 
inhibits Racl activation in cells with extra centrosomes. For —NSC23766, 

n = 37; +NSC23766, n = 36. Error bars represent mean = s.e. All the P values 
were derived from unpaired two-tailed Student’s t-test (***P < 0.0005; 

**P < 0,005; *P < 0.05). Scale bar, 10 tm. 
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Extended Data Figure 8 | Cell-cell adhesion defects caused by centrosome 
amplification can be observed in tetraploid cells and can be suppressed by 
Arp2/3 complex inhibition. a, Western blot from a pull-down experiment 
to detect GTP-bound Rac] in tetraploid MCF10A cells. b, Distribution of the 
cell-cell junction angles (left) and size (right) in the indicated tetraploid cells, 
with or without treatment with the Racl inhibitor, NSC23766. Note that 
tetraploid cells with extra centrosomes (4N) have a striking defect in junction 
positioning by comparison with tetraploid cells with normal centrosome 
number (4N.evo). This severe phenotype is only partially rescued by Racl 
inhibition. For 4N.evo, n = 106; 4N, n= 70; NCS23766, n= 47. Error bars 
represent mean + s.e. c, Examples of cell doublets with (+Dox) or without 


Dox: - + + 
CK-666: - - + 


-Dox 


+Dox 


+ CK-666 


(—Dox) centrosome amplification on the fibronectin micro-patterns. Cells 
were stained for F-actin (red), B-catenin (green), DNA (blue). d, Examples of 
cell doublets with (+ Dox) or without (—Dox) centrosome amplification 

on the fibronectin micro-patterns treated with the Arp2/3 inhibitor (CK-666). 
Cells were stained for F-actin (red), B-catenin (green), DNA (blue). 

e, Distribution of the junction angle and quantification of the junction size in 
cells with extra centrosomes treated with 50 1M of Arp2/3 inhibitor (CK-666) 
for 6h. Cells were analysed 48 h after Dox treatment. For -Dox, n = 251; 
+Dox, n = 160; CK666, n = 168. Error bars represent mean = s.e. All the 

P values were derived from unpaired Student’s two-tailed t-test 

(***P < 0.0005; **P = 0.005). Scale bar, 10 pum. 
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Extended Data Figure 9 | Quantification of centrosomal y-tubulin in cells 
with extra centrosomes and levels of active Racl in the absence of EGF. 

a, Images of centrosomes from interphase MCF10A cells stained for y-tubulin. 
Boxes represent the region for measurement of centrosomal y-tubulin signal 
(inside) and background (area between inner and outer boxes). b, Method used 
to determine the integrated fluorescent intensity of centrosomal y-tubulin, as 
previously described*'. c, Measurement of y-tubulin intensity at the 
centrosomes showing that in interphase, increased centriole number is 
sufficient to increase y-tubulin levels at the centrosomes whereas increased 
ploidy per se does not (4Nevo). For PLK4 -Dox, n = 60; PLK4 +Dox, n = 49; 
4N.evo. PLK4 -Dox, n = 34; 4N.evo. PLK4 + Dox, n = 35. Error bars represent 
mean + s.e. d, Increased Racl activity in cells with extra centrosomes can be 
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detected in arrested cells deprived of EGF. Quantification of Racl 

activity by FRET in single cells with (+Dox) and without (— Dox) extra 
centrosomes in the absence of EGF, and examples of FRET images of cells in the 
absence of EGF. For PLK4 —Dox, n = 36; PLK4 + Dox, n = 47. Error bars 
represent mean + s.e. Scale bar, 10 um. e, FACS profiles of control (— Dox) and 
cells with extra centrosomes (+Dox) after 48 h of Dox treatment showing that 
there is not major difference in the cell cycle profiles of these cells. Note that 
at this time point centrosome amplification does not produce a 

striking defect in cell proliferation (Extended Data Fig. 1d). All the P values 
were derived from unpaired two-tailed Student’s t-test (**P < 0.005; 

**D << (),0005). 
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Extended Data Figure 10 | Depletion of CEP192 suppresses the invasive 
properties of cells with centrosome amplification. a, Scheme of the 
experimental design to induce centrosome amplification in cells depleted of 
CEP192 by siRNA. Transient overexpression of PLK4 is induced 6h after 
siRNA to allow efficient centrosome overduplication. As expected, after 
depletion of CEP192 for 48 h, cells are partially compromised in their ability to 
overduplicate centrosomes after PLK4 overexpression*®. b, Western blot 
showing efficient depletion of CEP192 after 48 h treatment of cells with CEP192 
siRNA. c, Western blot showing partial depletion of CEP192 by shRNA. 

d, Quantification of centrosomal y-tubulin after depletion of CEP192 by siRNA 
for 48 h. Similar results were observed with CEP192 esiRNA (not shown). It is 
notable that at least for a three-day period, cells remain viable after CEP192 
knockdown. For control siRNA, n = 22; CEP192 siRNA, n = 20. Error bars 
represent mean + s.e. Quantification of centrosome amplification after 
depletion of CEP192 by siRNA (e) or shRNA (f). Error bars represent 
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mean ~ s.e. from 3 independent experiments. g, Bright field images of acini 
after 4 days in 3D culture, demonstrating that partial CEP192 depletion by 
shRNA does not significantly impair cell growth or the formation of acini. Red 
arrows indicate the invasive acini. h, Quantification of PLK4-mediated 
centrosome amplification in cells depleted of CEP192 after 4 days 

in 3D cultures showing that these cells still carry extra centrosomes. Error bars 
represent mean + s.e. from 3 independent experiments. Images show 

normal acini displaying centrosome amplification after partial knockdown of 
CEP192. Cells were stained for F-actin (red), centrioles (centrin1-GFP, 
green) and DNA (blue). Scale bar, 10 jum. i, Levels of active Racl measured 
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Membrane proteins bind lipids selectively to 
modulate their structure and function 


Arthur Laganowsky’*, Eamonn Reading'*, Timothy M. Allison', Martin B. Ulmschneider?, Matteo T. Degiacomi’, 


Andrew J. Baldwin! & Carol V. Robinson! 


Previous studies have established that the folding, structure and func- 
tion of membrane proteins are influenced by their lipid environments'” 
and that lipids can bind to specific sites, for example, in potassium 
channels*. Fundamental questions remain however regarding the 
extent of membrane protein selectivity towards lipids. Here we report 
a mass spectrometry approach designed to determine the selectivity 
of lipid binding to membrane protein complexes. We investigate the 
mechanosensitive channel of large conductance (MscL) from Myco- 
bacterium tuberculosis and aquaporin Z (AqpZ) and the ammonia 
channel (AmtB) from Escherichia coli, using ion mobility mass spec- 
trometry (IM-MS), which reports gas-phase collision cross-sections. 
We demonstrate that folded conformations of membrane protein 
complexes can exist in the gas phase. By resolving lipid-bound states, 
we then rank bound lipids on the basis of their ability to resist gas 
phase unfolding and thereby stabilize membrane protein structure. 
Lipids bind non-selectively and with high avidity to MscL, all impart- 
ing comparable stability; however, the highest-ranking lipid is phos- 
phatidylinositol phosphate, in line with its proposed functional role 
in mechanosensation’. AqpZ is also stabilized by many lipids, with 
cardiolipin imparting the most significant resistance to unfolding. 
Subsequently, through functional assays we show that cardiolipin 
modulates AqpZ function. Similar experiments identify AmtB as 
being highly selective for phosphatidylglycerol, prompting us to 
obtain an X-ray structure in this lipid membrane-like environment. 
The 2.3 A resolution structure, when compared with others obtained 
without lipid bound, reveals distinct conformational changes that 
re-position AmtB residues to interact with the lipid bilayer. Our 
results demonstrate that resistance to unfolding correlates with spe- 
cific lipid-binding events, enabling a distinction to be made between 
lipids that merely bind from those that modulate membrane protein 
structure and/or function. We anticipate that these findings will be 
important not only for defining the selectivity of membrane pro- 
teins towards lipids, but also for understanding the role of lipids in 
modulating protein function or drug binding. 

Three membrane protein complexes were selected to give a range of 
topologies, oligomeric states and anticipated selectivity towards lipids: 
(1) the pentameric MscL from M. tuberculosis with two transmembrane 
helices per subunit and an intimate relationship with lipids'®”’; (2) the 
tetrameric water efflux channel AqpZ from E. coli with six transmem- 
brane helices for which associated lipid or detergent molecules have 
been revealed in crystal structures and in related homologues’*"’; and 
(3) the trimeric AmtB from E. coli, with eleven transmembrane helices 
involved in the transport of ammonia, for which no lipid binding has 
been observed to date in crystal structures™. 

Our first objective for studying membrane protein and lipid inter- 
actions by mass spectrometry was to maintain the folded state of these 
multimeric assemblies in the gas phase, devoid of detergent from the 
micelles required for their introduction", such that stabilization by lipid 
binding could be deduced (Fig. 1a). This had not been possible previ- 
ously as the conditions needed to disrupt the micelle perturbed the folded 


state of membrane protein complexes. We screened non-ionic detergents’® 
and employed ion mobility mass spectrometry (IM-MS) (Supplemen- 
tary Video 1), a technique that measures the rotationally averaged col- 
lision cross-sections (CCS) of protein complexes in the gas-phase’. We 
found that in detergents commonly used for structural studies (n-dodecyl- 
B-p-maltoside (DDM), octyl glucose neopentyl glycol (OGNG) and 
n-nonyl-B-D-glucopyranoside (NG)), the CCS ofall charge states were 
substantially greater when emerged from the micelle than those calcu- 
lated for the crystal structure, indicative of gas-phase unfolding (Extended 
Data Fig. 1). Interestingly, for tetraethylene glycol monooctyl ether (CgE4) 
lower average charge states were observed with CCS in agreement with 
those calculated for the crystal structures, thus providing optimal con- 
ditions for mass spectrometry of intact membrane protein complexes. 

To assess individual lipid-binding events, we prepared a series of 
synthetic and natural lipid solutions and added these to protein com- 
plexes in CgE, (Extended Data Fig. 2). MscL was tested first, revealing 
well-resolved mass spectral peaks corresponding to populations of MscL 
in complex with one to five lipids (Fig. 1b and Extended Data Fig. 3). 
Unlike other biophysical approaches, which report an average of all 
species in solution, whether bound or unbound, discrete lipid-bound 
states can be resolved and interrogated individually, allowing us to obtain 
CCS for individual lipid-binding events and to assess their effects on 
stability. Maintaining folded structures of these lipid-bound complexes 
enabled us to explore molecular dynamics simulations combined with 
filtering using CCS measurements’ to locate probable lipid-binding sites 
(Extended Data Figs 4 and 5 and Supplementary Discussion). We then 
applied collision-induced unfolding, analogous to that used previously 
for studying ligand binding to soluble complexes”’. An increase in CCS, 
from the trend line calculated for the folded state, is observed with increa- 
sing collision voltage and is assigned to highly extended conformers 
(Fig. 1b and Supplementary Video 2). All species, apo and one to five 
phospholipid-bound forms, are measured within the same experiment 
enabling a direct comparison, and show less unfolding of native pen- 
tameric states as a function of lipid binding. 

To extract quantitative values for the effects of lipid binding, we applied 
an equilibrium unfolding model, used extensively in solution studies 
involving chemical denaturation”®”’. Plotting CCS against collision volt- 
age reveals unfolding trajectories from the native state, through interme- 
diates, to extended forms. Importantly, the oligomeric state is maintained 
throughout the unfolding experiment (Extended Data Fig. 6 and Fig. 1c). 
We identified four distinct intermediate states in the unfolding trajec- 
tories and by comparing the apo form with lipid-bound states we cal- 
culated the stabilization imparted at the transitions by each lipid (see 
Methods). The stabilization computed for bound lipids allows us to 
compare and rank seven different lipids, as well as synthetic phospha- 
tidylcholine (PC) lipids with chain lengths varying from C14 to C24. 
Despite differences in these lipid structures, we found that all stabilized 
MscL to a similar degree, consistent with a previous fluorescence study 
with engineered Trp residues” (Fig. 1d and Extended Data Fig. 3b,c). 
Wealso found that MscL avidly bound phosphatidylinositol phosphate 
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Figure 1 | The mechanosensitive channel of large conductance (MscL) 
resists unfolding in the presence of lipids. a, MscL in electrospray droplets, 
within a lipid-detergent micelle, undergoes desolvation and activation. b, Mass 
spectrum reveals several MscL phosphatidylinositol phosphate (PI) bound 
states. lon mobility (top) of MscL(PI)o_s with a trend line for native state 
(dotted). c, Plot of collision voltage against CCS for +12 ions of apo and 


(PI), more strongly than all other lipids investigated, conferring a large 
linear increase in stability upon binding many PI molecules. 

We applied a similar procedure to AqpZ, revealing well-resolved lipid 
binding and a CCS in agreement with that of the crystal structure of 
AqpZ (Fig. 2a and Extended Data Fig. 7). Monitoring unfolding trajec- 
tories for lipid-bound states of AqpZ showed similar transition points 
between the different states that were largely independent of the bound 
lipid. This implies that the lipids investigated had similar effects on resis- 
tance to unfolding (Fig. 2b and Supplementary Video 3). As successive 
lipids are added, from one to three, stabilization is increased via regular 
increments consistent with a linear, cumulative effect. The only exception 
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Figure 2 | AqpZ is indiscriminately stabilized by lipids with the exception 
of cardiolipin, a lipid that stabilizes the channel significantly and modulates 
its function. a, Mass spectra reveal AqpZ(POPC)o_s resolved at 60 V with 
IM-MS arrival times in agreement with the CCS calculated from the crystal 
structure. b, Stabilization of AqpZ (+13) bound to lipids, calculated from 
unfolding parameters. Shown as described in Fig. 1d. One-way ANOVA 
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MscL(PI)4. Experimental and modelled unfolding plots with collision voltages 
at which transitions occur and CCS values (horizontal and vertical arrows, 
respectively). R values are provided. d, Stabilization calculated from 
parameters defined by fitting MscL (+ 12) with lipids. Reported are average and 
s.e.m. from repeated measurements (n = 3) in units of electron volts (eV). 


to this is cardiolipin (CDL), for which binding resulted in a substantial 
increase in protein stability compared to all other lipids. Intrigued by 
this observation, we performed functional assays” of the channel recon- 
stituted into liposomes of varying lipid composition. Water permeability 
assays of AqpZ reconstituted in a total polar lipid extract from a CDL- 
deficient E. coli strain** (BKT22) showed a striking reduction in water 
transport compared to the channel reconstituted in wild-type E. coli 
polar lipid preparations (EPL) (Fig. 2c and Extended Data Fig. 8). More- 
over, reconstituting AqpZ in a lipid preparation from strain BKT22 
expressing cardiolipin synthase genes”, restores CDL to near wild-type 
levels (mole percentage of 6.6 versus 9.1), and results in water transport 
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(n = 3), **P <0.01.¢, Water permeability assay for AqpZ reconstituted in total 
polar lipid extracts from E. coli (EPL) (orange), or a cardiolipin-deficient 
strain** (BKT22) (green), or BKT22 cells expressing ClsC and YmaB to restore 
cardiolipin (BKT22-YC) (cyan), compared with empty EPL liposomes (pink) 
(Extended Data Fig. 8). Rate constants of water transport (kwat) and standard 
error of replicates (n = 5). 
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activity comparable to in EPL (Fig. 2c). These results clearly demon- 
strate CDL modulates the function of AqpZ. 

Analogous IM-MS experiments carried out on AmtB show that resis- 
tance to gas-phase unfolding following binding of anionic lipids, phos- 
phatidic acid (PA) and phosphatidylserine (PS), is not correlated with 
significant stabilization of the channel (Fig. 3). Zwitterionic lipids, PC 
and phosphatidylethanolamine (PE), confer only moderate stabilization. 
By contrast, addition of CDL or phosphatidylglycerol (PG) results in 
striking increases in stabilization with cumulative binding of PG increa- 
sing protein stability linearly (Fig. 3b, Extended Data Fig. 7 and Sup- 
plementary Video 4). These results therefore reveal a defined selectivity 
for PG-like headgroups. 

To understand the origin of this observed selectivity, we crystallized 
Am#tB in the presence of PG. A screen produced X-ray-grade crystals 
which were used for structure determination at 2.3 A resolution (Extended 
Data Table 1). The structure of AmtB is composed of three protein sub- 
units, each comprising of 11 transmembrane helices”, with two trimers 
located in the asymmetric unit (Fig. 4). We resolved eight PG molecules 
in the electron density on the periplasmic side of AmtB (Extended Data 
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Figure 3 | Lipid binding to AmtB results in a range of stabilizing effects. 
a, Ion mobility mass spectra of AmtB(PG)o_4 molecules at 175 V with IM-MS 
unfolding data as described in Fig. 1b. Arrival times for the various charge states 
reveal non-native, trimeric structures. b, Stabilization of AmtB (+15) bound 
to different lipids and AmtBN74/N794 (4.15), a mutant form designed to 
disrupt the specific PG-lipid binding site. Gas-phase unfolding reveals that 
AmtB is weakly stabilized by most lipids but CDL and PG confer significant 
stabilization. AmtBN’*4/X’* shows a reduction in stabilization by PG 
compared with the wild-type protein. Results are shown as in Fig. 1d with 
*P<0.05. 
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Fig. 9), a potential ninth PG molecule having ambiguous electron density. 
Two of the PG molecules are located in similar positions at the subunit 
interface, with headgroups not in contact with the protein (Fig. 4b). 
The overall structure is similar to the 20 previously reported in the Pro- 
tein Data Bank (PDB) determined from detergent-only environments. 
Major differences arise from a distinct conformational change in a loop, 
residues 70-81, forming a specific lipid-binding site (Fig. 4c, Supplemen- 
tary Video 5 and Supplementary Discussion). PG forms hydrogen bonds 
to N72 and E70 and a water bridge to N79. F75 flips downward to 
interact with a lipid tail of PG. In addition, several residues are reposi- 
tioned to interact with the phospholipid bilayer. L81 and 178 shift into 
the hydrophobic environment of the lipid bilayer. Unexpectedly, W80 
moves almost 4 A (Co. to Ca), its side-chain rotated 160° compared to 
other structures. This rotation relocates the W80 side-chain from a pro- 
tein environment into one that is ideally positioned to interact with head- 
groups of phospholipids, consistent with their alignment along the lipid 
bilayer interface (Extended Data Fig. 9d)*°. Furthermore, mutation of 
N72 and N79 to alanine (AmtBN77“/N4), engineered to abolish the 
specific lipid-binding site, results in the loss of PG induced gas-phase 
stability (Fig. 3b), consistent with our IM-MS approach identifying 
specific lipid-binding events. 

By resolving individual lipid-bound states of membrane protein com- 
plexes within the mass spectrometer, and interrogating their stability 
through gas-phase unfolding, we obtained quantitative values to rank 
lipid-binding interactions. MscL binds lipids non-selectively, without 
regard to particular headgroup or chain length, and binding of any lipid 
imparts comparable stability, in accord with the ability of the channel 
to bind and respond promiscuously to lipid composition'’”*. The most 
significant cumulative effects on stability were observed upon binding 
of PI, proposed as the functional lipid in the Mycobacterium membrane 
involved in MscL mechanosensitivity’. Similarly, AqpZ was found to 
be non-selective for a number of lipids suggesting comparable binding 
modes, in line with the two-dimensional crystals of Aqp0'*””. The only 
exception to this non-selective lipid binding is CDL, which significantly 
stabilizes AqpZ and directly modulates function, as suggested in a pre- 
vious in vivo study”. Interestingly, CDL levels in E. coli depend on growth 
conditions, a higher abundance of CDL being present in the stationary 
phase, implying AqpZ function is fine-tuned through modulation of 
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Figure 4 | Crystal structure of AmtB bound to PG. a, AmtB, in cartoon 
representation, is viewed from the periplasm and perpendicular to the 
membrane plane. Resolved PG molecules (spheres) are oriented with 
headgroups on the periplasmic side, in the outer leaflet of the inner membrane. 
b, PG molecules at subunit interface are shown with residue labels and 
hydrophobic residues in contact with lipid tails (stick representation). 
Hydrogen bonds (dashed lines) are formed between the phospho headgroup 
and K84, and a water bridge (aqua sphere). c, Conformational change of 
(yellow arrows) residues 70-81, induced by binding PG. Superposition of 
AmtB structure without lipid (PDB accession code: 1U7G, purple) bound to 
PG (green). 
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CDL levels. In contrast to the other two systems, our finding that AmtB 
responds only weakly to the majority of lipids is consistent with its 
extensive transmembrane structure and inherent stability devoid of the 
lipid bilayer”. However, its unexpectedly high selectivity for PG led to 
the first crystal structure of AmtB in complex with lipid, allowing effects 
on stability to be rationalized and further validated by mutation of res- 
idues forming the specific lipid-binding site. 

By investigating a diverse set of membrane proteins and their lipid- 
binding properties we have extended many seminal studies’ * by dem- 
onstrating different degrees of selectivity in lipid binding with AmtB 
> AqpZ > MscL. Despite these differences in selectivity, in all cases the 
most stabilizing lipids have a direct influence on the structure and func- 
tion of the membrane proteins studied, as proposed for PI binding to 
MscL’ and demonstrated for CDL and PG binding to AqpZ and AmtB, 
respectively. As membrane proteins are intimately embedded in the 
bilayer, the relative lipid abundances, as well as differences in select- 
ivity, can enforce the recruitment of a local lipid environment”, thus 
providing an elegant means for fine-tuning membrane protein struc- 
ture and function. 


METHODS SUMMARY 


Detailed methods are described in Methods. In brief, membrane proteins were 
expressed as TEV protease cleavable fusion proteins containing a terminal His-tag 
in Escherichia coli. Membrane proteins were extracted from purified membranes, 
followed by affinity chromatography, TEV protease cleavage and gel filtration chro- 
matography. Purified membrane complexes and lipids were prepared for IM-MS 
as previously described’*. Lipids were added to solutions containing membrane 
protein complexes and gas-phase unfolding was induced at different collision voltages 
with a step size of 5 V ona modified Waters Synapt G1 HDMS instrument contain- 
ing a linear drift cell. IM-MS data were analysed with software developed in-house. 
AgqpZ functional assays were performed as previously described” using commer- 
cial or in-house E. coli total polar lipid extracts. Crystals of AmtB in the detergent 
CE, were grown in the presence of PG in hanging-drop plates. Single crystals were 
mounted and flash frozen. Diffraction data were collected at the Diamond Light 
Source beam line 104. The structure was solved by molecular replacement, followed 
by automated and manual model building. MD was performed on membrane pro- 
teins in POPC bilayers, and candidate models were filtered using CCS measurements. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 


Received 22 November 2013; accepted 28 April 2014. 


1. Singer, S. J. & Nicolson, G. L. The fluid mosaic model of the structure of cell 
membranes. Science 175, 720-731 (1972). 

2. Cantor, R.S. The influence of membrane lateral pressures on simple geometric 
models of protein conformational equilibria. Chem. Phys. Lipids 101, 45-56 
(1999). 

3. Hunte, C. & Richers, S. Lipids and membrane protein structures. Curr. Opin. Struct. 
Biol. 18, 406-411 (2008). 

4. Lee, A. G. Biological membranes: the importance of molecular detail. Trends 
Biochem. Sci. 36, 493-500 (2011). 

5. Sanders, C. R. & Mittendorf, K. F. Tolerance to changes in membrane lipid 
composition as a selected trait of membrane proteins. Biochemistry 50, 
7858-7867 (2011). 

6. Contreras, F. X., Ernst, A. M., Wieland, F. & Brugger, B. Specificity of intramembrane 
protein-lipid interactions. Cold Spring Harb. Perspect. Biol. 3, http://dx.doi.org/ 
10.1101/cshperspect.a004705 (2011). 

7. Whitelegge, J. P. Integral membrane proteins and bilayer proteomics. Anal. Chem. 
85, 2558-2568 (2013). 

8. Long, S. B., Tao, X., Campbell, E. B. & MacKinnon, R. Atomic structure of a 
voltage-dependent K* channel ina lipid membrane-like environment. Nature 450, 
376-382 (2007). 

9. Zhong, D. & Blount, P. Phosphatidylinositol is crucial for the mechanosensitivity of 
Mycobacterium tuberculosis MscL. Biochemistry 52, 5415-5420 (2013). 


LETTER 


10. Chang, G., Spencer, R.H., Lee, A. T., Barclay, M. T.& Rees, D.C. Structure of the MscL 
homolog from Mycobacterium tuberculosis: a gated mechanosensitive ion channel. 
Science 282, 2220-2226 (1998). 

11. Haswell, E.S., Phillips, R. & Rees, D.C. Mechanosensitive channels: what can they 
do and how do they do it? Structure 19, 1356-1369 (2011). 

12. Savage, D. F., Egea, P. F., Robles-Colmenares, Y, O’Connell, J. D. Ill & Stroud, R. M. 
Architecture and selectivity in aquaporins: 2.5A X-ray structure of aquaporin Z. 
PLoS Biol. 1, e72 (2003). 

13. Gonen, T. et a/. Lipid—-protein interactions in double-layered two-dimensional 
AQPO crystals. Nature 438, 633-638 (2005). 

14. Khademi, S. etal. Mechanism of ammonia transport by Amt/MEP/Rh: structure of 
AmtB at 1.35A. Science 305, 1587-1594 (2004). 

15. Barrera, N. P., Di Bartolo, N., Booth, P. J. & Robinson, C. V. Micelles protect 
membrane complexes from solution to vacuum. Science 321, 243-246 (2008). 

16. Laganowsky, A, Reading, E., Hopper, J. T. & Robinson, C. V. Mass spectrometry of 
intact membrane protein complexes. Nature Protocols 8, 639-651 (2013). 

17. Ruotolo, B. T. et al. Evidence for macromolecular protein rings in the absence of 
bulk water. Science 310, 1658-1661 (2005). 

18. Baldwin, A. J. et a/. The polydispersity of «B-crystallin is rationalized by an 
interconverting polyhedral architecture. Structure 19, 1855-1863 (2011). 

19. Hyung, S. J., Robinson, C. V. & Ruotolo, B. T. Gas-phase unfolding and disassembly 
reveals stability differences in ligand-bound multiprotein complexes. Chem. Biol. 
16, 382-390 (2009). 

20. Pace, C. N. Determination and analysis of urea and guanidine hydrochloride 
denaturation curves. Methods Enzymol. 131, 266-280 (1986). 

21. Hong, H., Joh, N. H., Bowie, J. U. & Tamm, L. K. in Methods Enzymol Vol. 455 (eds 
Michael L. Johnson, Jo M. Holt, & Gary K. Ackers) 213-236 (Academic Press, 
2009). 

22. Powl,A.M., East, J.M.& Lee, A. G. Lipid-protein interactions studied by introduction 
of a tryptophan residue: the mechanosensitive channel MscL. Biochemistry 42, 
14306-14317 (2003). 

23. Borgnia, M. J., Kozono, D., Calamita, G., Maloney, P. C. & Agre, P. Functional 

reconstitution and characterization of AqpZ, the E. coli water channel protein. 

J. Mol. Biol. 291, 1169-1179 (1999). 

24. Tan, B.K. etal. Discovery of a cardiolipin synthase utilizing 

phosphatidylethanolamine and phosphatidylglycerol as substrates. Proc. Nat! 

Acad. Sci. USA 109, 16504-16509 (2012). 

25. Yau, W.M., Wimley, W. C., Gawrisch, K. & White, S. H. The preference of tryptophan 

for membrane interfaces. Biochemistry 37, 14713-14718 (1998). 

26. Iscla, 1. & Blount, P. Sensing and responding to membrane tension: the bacterial 

scL channel as a model system. Biophys. J. 103, 169-174 (2012). 

27. Hite,R.K., Li, Z.& Walz, T. Principles of membrane protein interactions with annular 

ipids deduced from aquaporin-0 2D crystals. EMBO J. 29, 1652-1658 (2010). 

28. Romantsoyv, T., Battle, A. R., Hendel, J. L., Martinac, B. & Wood, J. M. Protein 

ocalization in Escherichia coli cells: comparison of the cytoplasmic membrane 

proteins ProP, LacY, ProW, AqpZ, MscS, and MscL. J. Bacteriol. 192, 912-924 
(2010). 

29. Blakey, D. et al. Purification of the Escherichia coli ammonium transporter AmtB 
reveals a trimeric stoichiometry. Biochem. J. 364, 527-535 (2002). 

30. Engelman, D. M. Membranes are more mosaic than fluid. Nature 438, 578-580 
(2005). 


Supplementary Information is available in the online version of the paper. 


Acknowledgements We thank J. Hobman and D. Lee for providing gene-doctoring 
plasmids, Z. Guan and C. Li for providing E. coli strains and plasmids containing 
cardiolipin genes. We also thank D. Staunton and N. Housden for training on the 
stopped-flow apparatus; and T. Mize, J. Benesch, M. McDonough, T. Walton and D. Rees 
for discussions. We also gratefully acknowledge E. Lowe and S. Lea for organizing 
synchrotron proposals and Diamond Light Source beam line 104 and staff, the Medical 
Research Council (MRC), BBSRC and ERC advanced grant (IMPRESS) for funding. A.L. 
is a Nicholas Kurti Junior Research Fellow of Brasenose College, AJ.B. isa BBSRC David 
Phillip’s Fellow and C.V.R. is a Royal Society Professor. 


Author Contributions A.L., E.R., and C.V.R. designed the research. A.L. and E.R. 
performed the experiments. T.M.A. assisted A.L. and E.R. in protein expression and 
purification. M.B.U. carried out molecular dynamics. M.T.D., AJ.B. and A.L. designed 
and performed post-molecular dynamics analyses. A.L, E.R., T.M.A. and AJ.B. 
developed IM-MS analysis software. A.L. and E.R. analysed the data. A.L,E.R.andC.V.R. 
wrote the paper with input from the other authors. 


Author Information Atomic coordinates and structure factors for the crystal structure 
have been deposited with the Protein Data Bank (PDB) under accession code 4NH2. 
Reprints and permissions information is available at www.nature.com/reprints. 

The authors declare no competing financial interests. Readers are welcome to 
comment on the online version of the paper. Correspondence and requests for 
materials should be addressed to ALL. (art.laganowsky@chem.ox.ac.uk) or 

C\.R. (carol.robinson@chem.ox.ac.uk). 


5 JUNE 2014 | VOL 510 | NATURE | 175 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


METHODS 


Plasmid construction. Two expression plasmids were constructed by subcloning 
the multiple cloning site region from Xbal (New England Biolabs) and BlpI (New 
England Biolabs) of pET23b (Novagen) into the backbone pET15b (Novagen). 
The resulting engineered vector was linearized by Ndel (New England Biolabs) and 
Xhol (New England Biolabs), gel purified (QIA quick Gel Extraction Kit, Qiagen), 
and used in subsequent Infusion cloning reactions (Clonetech) to generate a TEV 
protease cleavable C-terminal fusion to superfolder GFP (subcloned from Gandhi 
et al.*') followed by a 6X His-tag or a TEV protease cleavable N-terminal fusion to 
maltose binding protein (MBP) preceded by a secretion signal peptide (pelB) and 
10X His-tag subcloned from Hilf et al.**. The resulting N- and C-terminal fusion 
vectors were linearized with Nhel and Xhol or NdeI and Nhel, gel purified, and 
used in subsequent Infusion cloning reactions. AmtB (residues 26-428), AqpZ and 
MscL genes were amplified by polymerase chain reaction (PCR) with Phusion high- 
fidelity DNA polymerase (New England Biolabs) from prepared E. coli BL21 (DE3) 
genomic DNA (Qiagen) or template plasmid DNA with primers designed for an 
Infusion cloning reaction using the manufacturer’s online tool. The PCR products 
were purified by agarose gel electrophoresis and extracted using the QIAquick gel 
extraction kit (Qiagen). The PCR products and linearized vectors (described above) 
were used in Infusion cloning reactions (Clonetech) to generate MBP-AmtB, AqpZ- 
GFP and Mscl-GEP. MBP—AmtBN7?4/N”°4 was achieved by two-rounds of site- 
directed mutagenesis using a QuikChange Lightning Multi Site-Directed Mutagenesis 
Kit (Stratagene, La Jolla, CA) according to the manufacturer’s protocol. AqpZ har- 
bouring a TEV protease cleavable N-terminal 6 His-tag (NHis—-AgpZ) was con- 
structed through an infusion cloning reaction of AqpZ and a custom pRSFDuet-1 
vector (Novagen), the N-terminal coding sequence between Ncol and BamHI was 
replaced with the following nucleotide sequence: CCATGGGCAGCAGCCATCA 
CCATCATCACCACGAGAACCTGTACTTCCAGGGTGGATCC, linearized with 
BamHI and Xhol. All constructs were verified by DNA sequencing. 

Generation of E. coliBL21(DE3) AmscL::Kan* strain. Homologous recombina- 
tion mediated chromosomal disruption of mscL was performed using the Gene 
Doctoring method”. Briefly, 900 base pairs flanking the 5’ and 3’ ends of the mscL 
gene were subcloned into the pDOC-K plasmid resulting in these regions flanking 
a kanamycin resistance (Kan*®) cassette. This plasmid and the recombineering 
plasmid, pACBSCE, were transformed into E. coli BL21 (DE3) Gold (Agilent). 
Homologous recombination was carried out by induction of A-Red recombinase 
system and Scel endonuclease with arabinose. Recombinants harbouring gene dis- 
ruption with Kan® were selected on kanamycin and sucrose, and verified by colony 
PCR. Calcium chloride competent cells were made as described by Drew et al.**. 
Membrane protein expression. MBP-AmtB and AqpZ-GFP plasmids were trans- 
formed into E. coli BL21 (DE3) Gold (Agilent). MBP-AmtBN724/N794 and NHis- 
AqpZ were transformed into E. coli OverExpress C43 (DE3) (Lucigen). MscL-GFP 
was transformed into E. coli BL21(DE3) AmscL::Kan® (described above). Several 
colonies were inoculated into 50 ml LB Miller (5 g yeast extract, 10 g peptone from 
casein, and 10 g sodium chloride per litre) and grown overnight at 37 °C. One litre 
of LB in 2 litre shaker flasks was inoculated with 7 ml of overnight culture and grown 
at 37 °C until the culture reached OD ¢oonm (OD¢o0) between 0.6 to 0.8. Isopropyl B-p- 
1-thiogalactopyranoside (IPTG) was added to the culture at a final concentration of 
0.5 mM and grown for 3 hat 37 °C. Cells were collected by centrifugation at 5,000g 
for 10 min at 4 °C. Cell pellets were resuspended in phosphate-buffered saline, pel- 
leted by centrifugation at 5,000g for 10 min at 4 °C and stored at —80°C. 
Purification of membranes. Membranes were purified as described by Newby 
etal.*° with minor modifications. Cell pellets were thawed and resuspended at 20 ml 
per litre of culture in Buffer A (300 mM sodium chloride, and 20 mM 2-amino-2- 
hydroxymethy]-propane-1,3-diol (Tris), pH 7.4 at room temperature) supplemented 
with a complete protease inhibitor tablet (Roche) and 5 mM beta-mercaptoethanol 
(BME). The cell suspension was passed several times through an M-110 PS micro- 
fluidizer (Microfluidics) at 19,000 psi. Insoluble material was pelleted by centrifu- 
gation at 20,000g for 25 min at 4 °C. Membranes were pelleted by centrifugation at 
100,000g for 2 h at 4 °C. Membranes were resuspended in ice-cold buffer B (100 mM 
sodium chloride, 20% glycerol, 5 mM BME and 20 mM Tris, pH 7.4 at room tem- 
perature), homogenized using a Potter-Elvehjem Teflon pestle and glass tube. For 
MscL-GEP, glycerol was omitted from Buffer B. Resuspended membranes were 
used either directly or flash frozen in liquid nitrogen and stored at — 80°C. 
AmtB-GFP detergent screen. A membrane protein detergent screen on AmtB- 
GFP was performed as previously described'® with minor modifications. Briefly, 
AmtB-GFP was extracted from purified membranes (described above) with 1-2% 
(w/v) of the detergent of interest in buffer B and incubated for one to three hours at 
room temperature or overnight at 4 °C with gentle agitation. Insoluble material was 
pelleted by centrifugation at 20,000g for 25 min at 4 °C. The clarified supernatant 
was loaded onto small Ni-NTA agarose (Qiagen) drip columns (Bio-spin Chroma- 
tography columns, Bio-Rad) and washed with several column volumes of buffer C 
(200 mM sodium chloride, 20 mM imidazole, 5 mM BME, 50 mM Tris, pH 8.0 at 


room temperature) supplemented with two times the critical micelle concentration 
(CMC) of the detergent of interest. AmtB-GFP was eluted with two column volumes 
of buffer D (10 mM sodium chloride, 250 mM imidazole, 5mM BME, 2X CMC 
detergent of interest, and 50 mM Tris, pH 8.0 at room temperature). Eluted protein 
was concentrated using a 100 kDa Molecular Weight Cutoff (MWCO) concentrator 
and buffer exchanged into MS Buffer (2x CMC detergent of interest and 200 mM 
ammonium acetate, pH 8.0 with ammonium hydroxide) using a centrifugal buffer 
exchange device (Micro Bio-Spin 6, Bio-Rad). 
Purification of membrane proteins. MBP-AmtB, MBP-AmtBN”“/N”4 and 
AqpZ-GFP were extracted from purified membranes in buffer B supplemented 
with 200 mM OG and incubated with gentle agitation overnight at 4 °C. MscL was 
extracted from purified membranes with glycerol-free buffer B supplemented with 
1% OGNG. Extracted membrane proteins were clarified by centrifugation at 20,000g 
for 25 min at 4 °C. Supernatant was filtered before loading onto a 5 ml HisTrap-HP 
column (GE Healthcare, Piscataway, NJ) equilibrated in buffer E (200 mM sodium 
chloride, 10% glycerol, 20 mM imidazole, 0.025% DDM and 50 mM Tris, pH 7.4 
at room temperature). After the clarified supernatant was loaded, the column was 
initially washed with 40-50 ml of DDM-free buffer E supplemented with 1% OG 
for MBP-AmtB and AqpZ-GFP and 0.5% OGNG for MscL-GFP. Membrane pro- 
teins were then exchanged into several column volumes of buffer E until a steady 
baseline was reached. Membrane protein fusions were eluted with a linear gradient 
to 100% in two column volumes of buffer F (100 mM sodium chloride, 10% glycerol, 
500 mM imidazole, 0.025% DDM and 50 mM Tris, pH 7.4 at room temperature). 
Peak fractions were pooled, supplemented with 5 mM BME and His-tagged TEV 
protease**””, and dialysed against buffer G (150 mM sodium chloride, 10% glycerol, 
20 mM imidazole, 0.025% DDM and 50mM Tris, pH7.4 at room temperature) 
overnight at 4 °C. After overnight incubation, samples were filtered and passed back 
over a 5 ml HisTrap-HP column equilibrated in buffer E. Flow-through containing 
the untagged membrane protein was collected and concentrated using a 100 kDa 
MWCO concentrator. Notably, tag-removed membrane proteins with fusions to 
the C terminus contained the additional protein sequence ASGENLYFQ, or GAS 
for N-terminally tagged proteins, resulting from the TEV protease recognition 
sequence and cloning restriction site. Concentrated protein was either used imme- 
diately or flash-frozen in liquid nitrogen and stored at — 80 °C. Protein concentra- 
tion was measured using Biomate UV detector with the following calculated extinction 
coefficients* of 1.57, 1.484, and 0.265 M~!cm7! for AmtB, AgpZand MscL, respectively. 
NHis-AqpZ was purified as previously described”. Briefly, extracted and clari- 
fied membrane proteins were loaded onto a 5 ml HisTrap-HP column (GE Health- 
care, Piscataway, NJ) equilibrated in buffer (200 mM sodium chloride, 10% glycerol, 
20 mM imidazole, 1% OG and 50 mM Tris, pH 7.4 at room temperature). NHis— 
AgpZ was eluted with a linear gradient to 100% elution buffer (100 mM sodium 
chloride, 10% glycerol, 500 mM imidazole, 1% OG and 50 mM Tris, pH 7.4 at room 
temperature) over two column volumes. Peak fractions containing NHis-AqpZ 
were concentrated before imidazole removal using two 5 ml HiTrap desalting col- 
umns (GE Healthcare, Piscataway, NJ) in tandem. Peak fractions were pooled and 
NHis-AgqpZ was concentrated using a 50 kDa MWCO concentrator. 
Preparation of membrane proteins for native mass spectrometry. Either flash- 
frozen samples thawed on ice or fresh samples were detergent exchanged by gel fil- 
tration chromatography. Membrane protein samples were injected onto a Superdex 
200 GL 10/300 (GE Healthcare) column equilibrated in buffer H (130 mM sodium 
chloride, 10% glycerol and 50 mM Tris, pH 7.4 at room temperature) supplemen- 
ted with either 0.5% of CgE4, 0.4% of NG, 0.116% OGNG or 0.025% of DDM. Peak 
fractions containing the detergent-exchanged membrane protein complex were 
concentrated. A 50 kDa MWCO concentrator was used to concentrate samples in 
the detergent CgE, and 100 kDa MWCO concentrator used on membrane protein 
complexes in all other detergents. Concentrated proteins were either used directly 
or flash-frozen in liquid nitrogen and stored at — 80 °C. Notably, we found no obser- 
vable difference in mass spectra quality after a single freeze-thaw of membrane 
proteins throughout our purification regime. Purified membrane proteins were 
buffer exchanged into MS Buffer (two times the CMC of detergent of interest and 
200 mM ammonium acetate, pH 7.2-8.0 with ammonium hydroxide) using a cen- 
trifugal buffer exchange device (Micro Bio-Spin 6, Bio-Rad) as previously described’*. 
Membrane proteins detergent exchanged into MS buffer supplemented with CsE4 
or OGNG could be flash frozen without compromising mass spectra quality unlike 
MS Buffer supplemented with NG or DDM. 
Preparation and titration of phospholipids. Phospholipids were purchased 
from Avanti (Avanti Polar Lipids Inc., Alabama, USA) and prepared at stock concen- 
trations of 10 mg ml! in 200 mM ammonium acetate pH 8.0 as previously described’. 
The membrane protein complex to phospholipid to detergent ratio (P:L:D) was opti- 
mized for each membrane protein complex to achieve consistency of nanoelec- 
trospray and resolved mass spectral peaks of bound phospholipid throughout 
the gas-phase unfolding series (Extended Data Fig. 2b). Phospholipid prepara- 
tions were added to buffer exchanged membrane protein complexes followed by 
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equilibration at room temperature for 10 to 30 min before mass spectrometry 
analysis. Notably, by mass spectrometry we did not observe a significant increase 
in bound phospholipid with longer incubation times. 
Mass spectrometry. Mass spectrometer settings were initially set to values for mem- 
brane proteins as previously described’* for a modified Q-TOF 2 mass spectrometer 
(Micromass, Manchester, UK) with a Z-spray source’. Typical instrument values 
were 5-7 bar for source pressure, 1.5-1.8 kV for capillary voltage, 150-190 V for 
cone voltage, 1-10 V for extraction voltage, 180-200 V for collision voltage, argon 
for collisional gas and 0.2-0.3 MPa argon gas pressure. 
Ion mobility mass spectrometry. Ion mobility measurements were performed on 
a Synapt G2 (where noted) or on a modified Synapt G1 HDMS instrument with 
the travelling-wave ion mobility cell replaced by an 18-cm drift cell with radial RF 
ion confinement and a linear voltage gradient to direct ions along the axis of trans- 
mission to the time-of-flight (TOF) mass analyser*’. The ion mobility mass spectro- 
meter was typically set to a source pressure of 5-7 mBar, capillary voltage of 1.4-1.7 kV, 
capillary nanoflow of 0.03-0.2 mBar and argon as collision gas with flow rate set 
to 5-8 ml min”! (~6.6-6.7 e * mBar). Collision voltage ranged from 50 to 240 V 
with measurements taken at 5 V steps for monitoring gas-phase unfolding of mem- 
brane protein complexes. The sample and extraction cone, and trap bias voltages, 
quadrupole profile, and collision (trap) gas pressure were optimized for maximal 
ion intensity of the target membrane protein complex. Helium was the drift cell gas 
and was set to a pressure of ~1.6-1.8 Torr (40 ml min flow rate). Pusher time, 
pulse width and TDC inhibit were set to 180, 7 and 7 1s, respectively. The ion guide/ 
source (300 ms ', 10 V), trap (300 ms !,0.2 V) and transfer (100 ms_', 10 V) trav- 
elling wave velocities and wave heights were kept constant. The mobility release 
time was set at 200 1s with a trap voltage of 30 V and extract voltage of 10 V. Pres- 
sure and temperature of the ion mobility cell was measured directly. 

Ion drift time (tg) was determined as previously described’. Briefly, tg was 
determined by subtracting the non-drift time component (f,) from the ion arrival 
time (t,) 


ta =t, —to 


t. was determined by extrapolation from linear plots of t, versus V_' over ten 
different voltage potentials (Extended Data Fig. 1d). Ion drift time was converted 
to a rotationally averaged ion-neutral collision cross section (CCS) (Q) using the 
Mason-Schamp equation” 
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here z is the charge on the ion, jz the reduced mass of the ion-neutral in the drift 
chamber, N is the drift gas number density, k, is the Boltzmann constant, e is the 
charge on the electron, V is the potential voltage across the drift cell, p is the pres- 
sure of the drift cell, T is the thermodynamic temperature of the drift cell, L is the 
length of the drift tube, T, is standard temperature and p, is standard pressure. 
Travelling wave ion mobility measurements (TW-IMS) were performed on a 
commercially available Synapt G2 (Waters). Source pressure set to 5-7 mBar, cap- 
illary voltage of 1.4-1.7 kV, capillary nanoflow of 0.03-0.2 mBar and argon as col- 
lision (trap) gas at flow rate of 8.0 ml min’ *. Collision voltage ranged from 50 to 
200 V with measurements taken at 5 V step intervals for monitoring gas-phase 
unfolding of membrane complexes. The sample and extraction cone and trap bias 
voltages, quadrupole profile, and collision gas pressure were optimized for maxi- 
mal ion intensity of the target membrane protein complex. Helium was the IMS 
entrance cell gas set to a flow rate of 180 ml min‘ and the drift cell gas was nitro- 
gen set to flow rate of 90 ml min” '. The EDC delay coefficient was 1.57 ps. The trap 
wave velocity and wave height were 300 ms ' and 8.0V, respectively. The IMS 
wave velocity and wave height were 300 ms__' and 30.0 V, respectively. The transfer 
wave velocity and wave height were 100 ms__' and 2.0 V, respectively. The mobility 
release time was set at 200 1s with a trap voltage of 30 V and extract voltage of 10 V. 
Gas-phase unfolding profiles are largely similar to those collected on a drift cell 
instrument (Extended Data Fig. 6d). 
Collision cross-section calculations. The projection approximation (PA) cal- 
culation within a modified version of MOBCAL'*** used to calculate the Op, 
from respective crystal structure coordinates. CCS values were corrected Qcor by 
the correction factor of 1.14 and missing residues between the protein construct 
(Mgxp) and crystal structure (Mppp)**”° 


a 

Mixp \3 

eg = 1-14 =| 
Mppg 


This correction has been shown to give values that correlate with known values 
to + 3%”, 

Gas-phase unfolding data analysis and modelling. Mass spectra and ion mobil- 
ity data were analysed using in-house software written in the Python programming 
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language with a graphical user interface constructed using wxPython (manuscript 
in preparation). Briefly, MS-only data collected on a Q-TOF 2 instrument was smoothed 
with MassLynx software before being imported and linearized at a step size of one 
in m/z. Theoretical mass spectra were predicted following equations described in 
Stengel et al.**. Mass, average charge state, width of charge state distribution, reso- 
lution, and concentration were fitted to experimental data with least squares regres- 
sion of the pseudo-y” function“. 

Ion mobility data were imported using a custom script provided by Waters at an 
m/z resolution of four in drift data and one in mass spectra for fitting. Mass spectra 
were smoothed and theoretical mass spectra were fitted as described above. A 
standardized integration window was employed for extracting arrival time distri- 
butions (ATD) using a resolution (R = m/z + Am/z) of 1,000 and a height cut-off 
of 25% for the modelled Gaussian to the respective m/z peak. To generate gas-phase 
unfolding plots, ATDs were extracted for the given m/z peak using the standardized 
integration window followed by normalization. To convert to a CCS axis, arrival 
time was directly converted to CCS using equations above. This procedure gener- 
ates gas-phase unfolding data and are plotted using Matplotlib”. All ion mobility 
mass spectra are shown in linear scale. 

To a first order approximation, gas-phase unfolding of protein complexes, like 
unfolding in solution, will be linearly dependent (m) on collision voltage (CV) 


AG=AG? — m[CV] 


where AG is free energy of unfolding and AG? is the free energy of the system at 
zero collision voltage in the gas-phase. Using this assumption, two, three, four and 
five state linear equilibrium unfolding models were used to model the abundance 
of native and non-native species. Notably, two new free parameters are introduced 
for each new species. Two, three, four and five state unfolding models were fitted to 
the data, and an F-test was used to decide if the fitting quality is sufficiently improved 
as to justify the inclusion of additional fitting parameters. The most abundant charge 
state of the three membrane protein complexes agreed with a four state unfolding 
model. For example, in the case of three unfolding species we used the following 
thermodynamic denaturing unfolding model’. 


KK 
NSLS hb 
Expansion of AGy., gives the following, 


AGyat, = AGL, — My=1, [CV] 


where Fis a mole fraction. The sum of mole fractions of native (Fy) and non-native 
species (Fj, and Fj, ) is equal to 1 


Fy + Fy, +Fy, =1. 


We can generalize this taking into account the entire relationships one can obtain 
the fraction of native and non-native given K, and K, 
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The respective AG’ and m parameters were solved numerically. In the two species 
case one can obtain a closed form solution, however, it is both efficient and general 
to solve them numerically. 

An irreversible kinetic unfolding model as an alternative for modelling gas- 
phase unfolding data was also explored. The irreversible unfolding model defines 
the degree of unfolding as being related to time. By changing the potential across 
the drift cell, the time ions spend in the ion mobility drift cell post collision-induced 
unfolding can be altered over several orders of magnitude. The relative abundances 
of unfolded intermediates species as a function of drift time was monitored for a 
membrane protein complex. There was no considerable change in relative abun- 
dances over a fourfold increase in time (Extended Data Fig. 6e, f) demonstrating 
that the unfolding mechanism does not agree with an irreversible kinetic model for 
the unfolding model and no further unfolding occurs post collision cell. Data are in 
excellent agreement however with a reversible model as described above. 
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Ion mobility intensities were extracted for native and unfolded CCS species at 
various collision voltages. With the assumption that signal intensity is approxi- 
mately proportional to species concentration, the data can be converted to a molar 
fraction to generate two dimensional (2D) data, which can be fitted to an equilibrium 
unfolding model. Minimized 2D parameters were used as a seed for three dimen- 
sional (3D) fitting of gas-phase unfolding data. 

From the unfolding curve, and parameters m, AG’, CCS,, (mean of CCS spe- 
cies), CCS, (standard deviation of CCS species), the 3D data can be fit. First, a CCS 
range was selected for each native and unfolded species in the unfolding plot. These 
CCS ranges provided the initial parameters used to model various CCS species 
identified by inspection. Next, to make the calculations more efficient, the gas-phase 
unfolding data were trimmed to a minimum and maximum value derived by either 
subtraction or addition of 10% times the native and last unfolding CCS species, 
respectively. Trimming resulted in gas-phase unfolding containing on average around 
900 data points. The equilibrium unfolding model described above was seeded 
with the parameters from the minimized 2D data and the molar fraction of species 
was normalized to match the normalization of experimental data. The fitting sur- 
face was rugged and required a sophisticated minimization procedure to maximize 
the chance of finding global minimum. First the 3D model was minimized using 
quasi-Newton method of Broyden, Fletcher, Goldfarb, and Shanno (BFGS) avail- 
able in Scipy”®. This provided the boundaries for bio-inspired algorithms; all para- 
meters were set to boundaries of + 25% with the exception of the CCS species peak 
centres which were set to + 5%. The 3D model was minimized using the modified 
differential algorithm (de_1220) available in PYGMO*!” using the following para- 
meters: population of 20; 12 evolutions; 250 generations; and 8 islands. Other 
algorithms, such as particle swarm and bee colony were screened however the 
modified differential algorithm produced similar or higher R? values in consid- 
erably less computation time. 

The resulting minimized 3D model enabled quantification of unfolding transi- 
tions and stabilization. An unfolding transition occurs when a species with a deter- 
mined CCS transitions to another CCS species, for example see Extended Data Fig. 6c. 
The transition midpoint, CV» is determined by: 


AG° 
CV50,3 = — 
m 


where CV5s9, is the value at which 50% of a specific transition state i is depleted. This 
metric was found to be particularly useful for this application as it removes correla- 
tions between AG® and m inherent in the fitting procedure. The stabilization of a 
protein due to the presence of additional phospholipids was quantified according to: 


ACV, = CVs — CV 


This comparative approach takes into account to first order any variability in trans- 
ition collision voltages between repeated measurements for a given protein. 

To compare between various proteins and different unfolding states, as well as 
to average out any systematic and/or inherent variability in droplet formation, the 
average stabilization was calculated for the bound ligand by 


Zz SS ACV50,i 
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Stabilization = 


where ¢ is the number of transitions, i is a specific transition, and z is the charge 
state. Averages and standard error of the means was calculated for three repeated 
measurements. Statistical significance between means were determined by a one- 
way ANOVA test (1 = 3) followed by post-hoc Tukey analysis. A Shapiro—Wilk test 
(at the 0.05 level) was used to infer the data was drawn from a normally distributed 
population. A Brown-Forsythe test (at the 0.05 level) was performed to infer homo- 
geneity of variance before means comparison. All statistical tests were performed 
using OriginPro 8.5.1. 

Molecular dynamics preparation and simulation. Crystal structures for the 
Ammonia Channel AmtB (1U7G)"*, Aquaporin Z (1RC2)!* and MscL (2OAR)'° 
were obtained from the Protein Data Bank and prepared. Molecular simulations 
were performed using GROMACS 4.5 (http://www.gromacs.org)”*, with the OPLS 
all-atom protein force field’’, OPLS united-atom lipid parameters for palmitoyloleoyl- 
phosphocholine (POPC)* and the TIP3P water model”. Initial configurations were 
created using hippo beta (http://www.biowerkzeug.com). 

Electrostatic interactions were computed using the particle mesh Ewald (PME) 
method®, and a cut-off of 10 A was used for van der Waals interactions. Bonds 
involving hydrogen atoms were restrained using LINCS”. Simulations were run 
with a 2 fs integration time step and neighbour lists were updated every five steps. 
All simulations are performed in an NPT ensemble, without additional applied 
surface tension. Solvent (water, ions), lipids, and the protein were each coupled 
separately to a heat bath at a temperature of 30 °C with time constant ty = 0.1 ps 
using weak temperature coupling”. Atmospheric pressure of 1 bar was maintained 


using weak semi-isotropic pressure coupling with compressibility k, = K,, = 4.6 
x 10° bar”! and time constant tp = 1 ps®. A complete list of simulations is given 
in Extended Data Fig. 4b. 

Each protein phospholipid pair generated by molecular dynamics (MD) was treated 

asa candidate model. For example, ~206,000 such models were generated for AqpZ 
and filtered by comparison of calculated and experimental CCS values. More spe- 
cifically, within each MD snapshot there are ~80 lipids that could be in accord with 
the CCS measurements for one lipid bound to AqpZ. After filtering by agreement 
with CCS, an ensemble of protein phospholipid complexes consistent with our data 
was identified. As the MD is used to provide a source of potential lipid associated 
structures, the results are largely invariant to the choice of force field and lipid 
molecule. 
Identifying lipid binding sites from MD simulations and CCS measurements. 
The object is to determine which MD frames are in best agreement with the exper- 
imental CCS data. For every simulated system, one frame was extracted every 0.2 ns 
generating 1,000 PDB files for a 500 ns simulation time. For each extracted frame, 
all lipids within 6 A of the protein were identified. All possible combinations of 
these two lipids and protein were selected. For example for AqpZ and one lipid 
typically yielded 80 extracted combinations per frame (Extended Data Fig. 4c). The 
CCS value of each combination per frame (f) was calculated using a modified 
version of MOBCAL'***, A ratio (Rycaic) was determined using the calculated 
CCS values for each of the +1 lipid and the +2 lipid-bound states 


CCSprotein +lipid,f.l 


Ry calc = 
tam CCSprotein,f 


where CCS,,otein+lipid is CCS of protein plus number of lipid(s) (J) and CCS, rotein is 
CCS of protein. Using a ratio avoided any potential issues associated with taking 
the difference of uncorrected CCS values determined experimentally. These mea- 
surements enable the calculation of three chi-squared (7) values for a given 
combination of two lipids and protein 


Xp a 
where Ryexp and o are the average and standard deviation of the experimental CCS 
ratio for data with one or two lipid molecules bound from three repeated measure- 
ments, and Reexp is the CCS ratio derived from theoretical calculations. The overall 
chi-squared values were calculated from: 


= Gat Grat Gv 


where 73, is the chi-squared calculated for two lipids bound, and 77 ,, and y7, are 


the chi-squared values for each individual lipid molecule with the protein. In the 
case where only the +1 lipid CCS data are analysed, neglecting the +2 lipid data, 
only one x’ value contributed towards the sum. The probability (Pp) that a given 
protein phospholipid pair with protein matches the data are given by the following 
Bayesian probability: 


=i) 
Proce 2 


where 72,;, is the minimum chi-squared value obtained across the entire simu- 
lation, whose corresponding P, is equal to 1, representing the most probable 
structure. It follows that all candidate lipid pair and protein combinations with a 
Prvalue greater than 0.785 will be one half standard deviation from the most prob- 
able structure. The ensemble of structures identified with Py > = 0.785 can be 
considered the most probable arrangement of lipids and protein found to be con- 
sistent with the experimental data. This ensemble of structures was projected onto 
the surface of the protein to identify the most probable locations of contact between 
the protein and the lipid. 

Total polar lipid extracts from cardiolipin-deficient E. coli strain, BKT22. The 
cardiolipin-deficient E. coli strain**, BKT22 (AclsA, AclsB, AclsC, A lymdB::Kan*) 
and calcium chloride competent BKT22 cells were transformed with pBAD-YC 
(plasmid containing clsC and ymdB genes”), referred herein as BKT22-YC, and 
grown overnight in LB media. Overnight cultures were diluted to an OD¢o9 of 0.03 
in LB media with the exception that arabinose was added toa final concentration of 
0.2% for BKT22-YC to induce expression of cardiolipin genes, clsC and ymdB. The 
strains were grown for six hours to stationary phase (OD¢o0 ~2.0). Cells were col- 
lected by centrifugation at 3,500g at 4 °C for 10 min. Cell pellets were resuspended in 
phosphate-buffered saline followed by centrifugation. 

Lipids were extracted using the Folch method". Briefly, cells were resuspended 
in 20 ml of 2:1 (v/v) chloroform:methanol per gram of cells and agitated on a stir 
plate for 20 min. The mixture was then clarified using filter paper and washed with 
0.2 volume of water. The extract was briefly vortexed before centrifugation at 800g 
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to separate phases. The upper aqueous phase was removed and the interface was 
washed with 1:1 (v/v) methanol:water without disturbing the lower organic phase. 
The lower organic phase was recovered and evaporated under a stream of nitrogen. 
The dried total lipid extract was then resuspended in diethyl ether to extract polar 
lipids and evaporated under nitrogen then under vacuum overnight. 

Thin-layer chromatography. HPTLC Silica gel 60 aluminium plates (Merck Millipore) 
were pre-developed with 1:1 (v/v) chloroform:methanol and left to air-dry overnight. 
The plates were then impregnated with an adsorbent modifier (1:1 (v/v) 1.2% boric 
acid in absolute ethanol-water) followed by activation at 100°C for one hour*’. 
Samples were spotted and plates developed in 65:25:5 (v/v) chloroform:methanol: 
acetic acid. After air-drying for 30 min, plates were sprayed with 10% sulphuric 
acid in ethanol and charred at 200 °C (ref. 63). Molar per cent (mol %) of lipid species 
was determined from the intensity of lipid species using ImageJ 1.47v software“. 
Proteoliposome/liposome preparation. E. coli total polar lipid extract (acetone/ 
ether preparation; Avanti Polar Lipids) was dissolved at 20 mg ml! in 2:1 (v/v) chlo- 
roform:methanol to obtain a clear solution. Solvent was evaporated under a stream 
of nitrogen gas then under vacuum overnight. E. coli total polar lipid (EPL), BKT22 
and BKT22-YC extracts were hydrated in 2mM BME to a final concentration of 
50mg ml | and left to incubate for 1 h at room temperature with constant agita- 
tion. These were then split into aliquots and frozen. Lipids were always handled 
under a nitrogen atmosphere. 

Liposomes and proteoliposomes were prepared following previously established 

protocols”*””. Briefly, lipid stocks were diluted into borosilicate tubes to a final con- 
centration of 45 mg ml‘ in 100 mM MOPS-Na pH 7.5 and pulsed in a bath soni- 
cator until a clear suspension was obtained. A reconstitution mixture (400 11) was 
prepared in a borosilicate tube at room temperature by sequentially adding 100 mM 
Mops-Na (pH 7.5), 1.25% (wt/vol) OG, purified NHis-AqpZ for proteoliposomes 
(final concentration 100 jig ml” '), and 10 mg ml sonicated lipids. After incuba- 
tion for 1 h at room temperature, the mixture was diluted 25-fold into 20 mM HEPES 
pH7.5. The liposomes were collected by centrifugation 140,000g for one hour and 
resuspended to a final volume of 1.5 ml in 20 mM HEPES pH 7.5. Liposomes and 
proteoliposomes were stored on ice before measurement. EPL, BKT22 and BKT22- 
YC liposomes/proteoliposomes had an average diameter of 140, 149 and 160 nm 
determined from dynamic light scattering (Viscotek 802). 
AqpZ water permeability assay. Liposome shrinkage was induced by rapidly mix- 
ing 1:1 (v/v) proteoliposomes/liposomes with osmolyte buffer (20 mM HEPES pH 7.5, 
570 mM sucrose; a 285 milliosmolarity gradient) using a stopped-flow apparatus 
(Applied Photophysics SX20). Water transport/permeability was monitored by 
measuring the light scattering (A = 600 nm) of the preparation upon mixing at 
8 °C. An increase in signal reflects liposome shrinkage. Light scattering (L) data 
from repeated measurements (n = 5) were fit to a single exponential rise equation 
to obtain the rate of water transport (kya) using OriginPro 8.5.1. 


L(t) =Ae~*" +L, 


where ¢ is time in msec, A is a constant, kya, is rate of water transport, and L, is the 
maximum light scattering (typically one for normalized data). 
AmtB crystallization and structure determination. AmtB was purified as des- 
cribed for native mass spectrometry experiments with one minor modification. AmtB 
was detergent exchanged into buffer H containing 0.5% CE, using a Superdex 200 
10/300 gel filtration column (GE Healthcare). Peak fractions of AmtB from gel 
filtration were pooled and concentrated to ~15 mg ml * using a 50 kDa MWCO 
concentrator (Millipore). A tenfold molar excess of phosphotidylglycerol was added 
to AmtB before crystallization. Crystals of AmtB were grown at 20°C in hanging 
drop plates with crystallization solution 15% PEG 4000, 0.8 M potassium formate, 
and 0.1 M sodium acetate pH 4.6. Hexagonal plate crystals appeared after one month. 
Single crystals were mounted with CrystalCap HT Cryoloops (Hampton Research, 
Aliso Viejo, CA) before flash frozen. Data were collected at Diamond Light Source 
beam line 104. Diffraction data collected at a wavelength of 0.97949 A were pro- 
cessed with XDS® in apparent space group P622. Cell content analysis suggested 
two molecules in the asymmetric cell and attempts to obtain phases from molecu- 
lar replacement using PHASER® with PDB: 1U7G resulted in placement of only 
one molecule with a missing layer of protein molecules. Twinning was suspected 
and the data were processed into lower symmetry space groups until molecular 
replacement could correctly place all protein molecules in the asymmetric cell, which 
turned out to be space group C222,. Data were submitted to the UCLA twin detection 
server” for twinning analysis that indicated partial twinning with <|L|> = 0.463 
(untwinned = 0.5, perfectly twinned = 0.375) and <L2> = 0.291 (untwinned = 0.333, 
perfectly twinned = 0.200). Despite the presence of twinning, initial phases were 
found by molecular replacement using PHASER with PDB 1U7G placing six 
molecules in the asymmetric unit cell, and followed automated model building 
using PHENIX® and manual model building using COOT®. All model refine- 
ment was done with REFMAC” in twin mode. Omit maps were generated first by 
deleting coordinates of the ligand before refinement. Refinement with twinning 
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applied resulted in a 10% drop in both Rwork and Reree, consistent with twinning. 
Ramachandran plot of the refined structure gave 96% in favoured regions and 
0.4% outliers. Figures were generated with PYMOL”. 
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$000 4000 5000 6000 7000 8000 9000 10000 11000 
m/z 
c 
Protein Q_ (A?) PDB 2 corr (A?) Relative error (%) 
[Charge state] (Resolution, A) Plas, BO EY ccs,,- CCS, 
AmtB 5821.3 + 0.03 [+13] 1U7G (1.40) 4841.1 5833.1 -0.2 [+13] 
5872.6 + 0.12 [+14] +0.7 [+14] 
5893.8 + 0.10 [+15] +1.0 [+15] 
5910.6 + 0.09 [+16] +1.3 [+16] 
6014.6 + 0.89 [+17 +3.1 [+17 
AqpZ 5034.7 + 0.13 [+11] 1RC2 (2.50) 4239.4 4970.8 +1.3 [+11] 
5101.5 + 0.05 [+12] +2.6 [+12] 
5100.6 + 0.05 [+13] +2.6 [+13] 
5092.1 + 0.05 [+14] +2.4 [+14] 
5124.3 + 0.13 [+15] +3.1 [+15] 
MscL 4582.4 + 0.61 [+11] 20AR (3.50) 4081.0 5438.5 -15.7 [+11] 
4587.5 + 0.51 [+12] -15.6 [+12] 
4605.0 + 0.45 [+13] -15.3 [+13] 
4715.2 + 0.22 [+14 -13.3 [+14 
d 
Protein Detergent Construct Monomer Measured Monomer Measured Oligomer Stoichiometry to (ms) 
mass (Da) mass (Da) mass (Da) 
AmtB C8E4 42278 42244 + 13 126719 +8 3 0.97 
NG 42238 + 3 126705 + 46 3 0.85 
DDM 42225 + 52 126729 + 10 3 0.75 
OGNG 42226 +9 126725 + 10 3 1.00 
AqpZ C8E4 24713 24720 + 1 98891 + 29 4 0.97 
NG 24722 +5 98886 + 36 4 1.02 
DDM 24729 + 48 98890 + 147 4 0.89 
OGNG 24722 +3 98936 + 23 4 0.92 
MscL C8E4 17034 17130 +2 85578 + 13 5 1.06 


Extended Data Figure 1 | Maintaining intact native membrane protein 
complexes in the mass spectrometer. a, b, Gas-phase unfolding plots (left, 5 V 
steps) and mass spectra (right) of detergent stripped AqpZ and AmtB ions 
(charge inset) from different non-ionic detergent solutions; CgE, (pink), NG 
(green), OGNG (purple) and DDM (orange). Membrane protein complexes 


from NG, OGNG and DDM possessed CCS values substantially larger than 
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those calculated from crystal structures. c, Complete removal of CgE, at 

low-collision voltages reveals CCS values consistent with those calculated from 
their respective crystal structures. d, Reported are measured masses, standard 
deviations, and empirical fy values used for direct CCS calculation of membrane 
proteins studied. 
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Average 
Phospholipid Name Abbreviation Molecular Phospholipid Head Group 
Weight 
O 
1,2-dinervonoyl-sn-glycero-3- aN, _p. @ 
phosphocholine (24:1 Cis) aye 954.4 * oF O--N< 
I 
1,2-dierucoyl-sn-glycero-3- 
phosphocholine (22:1 Cis) DEPG 898.3 
1,2-dieicosenoyl-sn-glycero-3- 
phosphocholine (20:1 Cis) DIPC aes 
1,2-dioleoyl-sn-glycero-3- 
phosphocholine (18:1 Cis) DOPC eee 
1,2-dipalmitoleoyl-sn-glycero-3- 
phosphocholine (16:1 Cis) DERG 730.0 
1,2-dimyristoleoyl-sn-glycero-3- 
phosphocholine (14:1 Cis) PMEC 73-2 
1-palmitoyl-2-oleoyl-sn-glycero- 
3-phosphocholine (16:0-18:1) FC BO" 
9 {4° 
1-palmitoyl-2-oleoyl-sn-glycero- 2 So-P- 4 
3-phospho-L-serine (16:0-18:1) PS cen 0 ae O NH 
(S) ® 
2 OH 
L-a-phosphatidylglycerol (E. coli) PG 761.1 *o-P-o_ A_LOH 
O 
} 
fe) 
1-palmitoyl-2-oleoyl-sn-glycero- *0-P-0OH 
3-phosphate (16:0-18:1) nn e702 of 
io) 
2 
L-a-phosphatidylethanolamine aN -P- ® 
(E. coli) PE 719.3 oR O-*NH, 
(} 
H 
1,2-dioleoyl-sn-glycero-3- OHO Ong OH 
phospho-(1'-myo-inositol-3'- Pl 977.1 *O-P-o OH 
phosphate) (18:1) (@) 
io} O é 
ORO 
0. 
7 HO~LH 
Cardiolipin (E. coli) CDL 1430.0 O 
io} 
b 
Protein Phospholipid Protein Phospholipid Detergent Ratio (P:L:D) # Bound Lipid mass 
Oligomer (uM uM M addition (Da 
AqpZ CDL 7 70 16300 1:10:2300 1 1378.5 412.5 
PE 7 348 - 1:50:2300 1 718.2 + 18.5 
PG 9 87 - 1:10:1757 3 767.6 + 11.1 
PS 9 383 - 1:40:1757 3 785.5 + 25.6 
PC 7 132 - 1:18:2300 3 762.3 + 14.0 
PA 7 449 - 1:61:2300 2 658.2 + 8.0 
AmtB CDL 3 140 16300 1:45:5197 1 1417.9 + 68.1 
PE 2 70 - 1:44:10395 2 688.7 + 29.1 
PG 4 99 - 1:28:4559 4 756.9 + 20.8 
PS 4 92 - 1:26:4559 3 759.9 + 22.9 
PC 4 95 - 1:27:4559 3 756.5 + 20.9 
PA 2 69 - 1:44:10395 3 664.5 + 14.0 
AmtBN72A/N794 PG 6 131 - 1:21:2558 3 763.1 + 21.3 
MscL CDL 17 175 8157 1:10:480 1 1430.0 + 41.7 
PE 17 70 - 1:4:480 1 705.5 + 25.8 
PG 17 131 - 1:8:480 2 778.4 + 39.4 
PS 17 128 - 1:7:480 2 794.8 + 22.0 
PC 17 132 - 1:7:480 3 776.4 + 29.7 
PA 17 143 - 1:8:480 4 672.7 + 36.5 
Pl 17 52 - 1:6:480 4 957.3 + 13.8 
DNPC 24 79 - 1:3:350 2 967.5 + 29.7 
DEPC 24 56 - 1:2:350 2 913.6 + 13.9 
DIPC 24 89 - 1:4:350 2 830.2 + 52.9 
DOPC 24 95 - 1:4:350 2 793.7 + 22.2 
DPPC 24 137 - 1:6:350 2 752.9 + 52.2 
DMPC 24 74 - 1:3:350 2 695.6 + 54.9 


Extended Data Figure 2 | Phospholipid abbreviations and optimization of masses. Conditions were optimized empirically to maintain nanospray, 
phospholipid binding experiments. a, Phospholipid abbreviations and sufficient mass spectral quality and phospholipid binding. Masses for one lipid 
headgroup structures. b, Protein to phospholipid to detergent ratios (P:L:D) for __ bound to the protein complex were measured using MassLynx V4.1 software 
each membrane protein, number of resolvable phospholipids bound with their (Waters). 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


a 
a 7) a 
= = — 
vo ov vo 
£ £ £ 
- F 2 - 
3 ro ro 
2 2 2 
< z 10 a 10 
00 100 
Fa = = 
a7) o a7) 
c c c 
a vo a 
E E. E 
oe v v 
> > > 
r= S SB 
S s Ss 
ov oO a 
4 4 "4 
ry a ry 
£ = E 
ov ov vo 
£ £ E 
- F 12 - 
g s 2 
< 10 < 
1 100 1 
z z= = 
a a a 
| c J 
a vo a 
2 2 g 
:= =, =, 
$ g g 
=] S P=) 
2 s i 
ov oO 7 
4 4 « 
b 
E E E 
ov ao a 
£ £ £ 
- - - 
ir ir 3 
2 2 2 
< < 10 < 
100 
z = = 
£ z £ 
i = c 
wo wo ov 
= £ £, 
vo vo a 
= > = 
BS Ss = 
8 8 Ss 
ov ou ov 
[4 ca a 
9000 
E = E 
ov o — 
E E : 
E wv i e 
= g 8 
< 10 < < 
00 
r= = > 
z g 2 
8 g 5 
= £, =, 
o wo a 
Fl 2 > 
ia 8 ia 
& Z 2 
7000 9000 
m/z 
c 
= 
2 
= 
fo} 
=] 
8 
= 
8 
a) 
14 16 18 20 22 24 
Chain Length 
Extended Data Figure 3 | Phospholipid binding to MscL reveals an and ion mobility spectra of MscL bound to phospholipids of varying alkyl chain 


insignificant impact on protein gas-phase stabilization independent of lipid —_ length. c, Cumulative stabilization of MscL is seen for one (blue) to two (red) 
alkyl chain length. a, Representative mass and ion mobility spectra of MscL _ bound lipid molecules. Reported are the means + s.e.m. from repeated 
bound to various phospholipids species (inset top right). b, Representative mass | measurements (n = 3). 
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a c 
Ratio Ratio Ratio 
Lipid ccs Standard ccs Standard ccs Standard 
P increase Deviation increase Deviation increase Deviation 
1 Lipid 2 Lipid 3 Lipid 
AqpzZ 
PS 1.0039 0.0023 1.0103 0.0023 1.0146 0.0019 
PG =—-1,0043 0.0009 1.0113 0.0015 1.0158 0.0008 
PE —-:1,0046 0.0016 
PC 1.0045 0.0027 1.0092 0.0018 1.0143 0.0029 
PA 1.0044 (0.0014 1.0095 0.0018 
CDL 1.0117 0.0020 
AmtB 
PS 1.0031 0.0023 1.0057 0.0021 
PG =—‘1.0031 0.0026 1.0054 0.0020 1.0084 0.0021 
PE —- 1.0018 0.0012 1.0038 0.0028 
1/20th of lipids shown 
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system my PC [#] ry POPC Bound CCS Calculations? ie pt Ge a pense 
CDL 1.0060 0.0047 
AqpZ 27,288 264 499.6 1 206,000 
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4 Se ees PS =—-1.0078 «0.0128 
AmtB 28,204 214 479.2 1 226,359 PG =—-1,0057 0.0117 
F 1835708 PE —-:1.0056 0.0040 
PC 1.0062 0.0084 1.0156 0.0074 1.0220 0.0069 
MscL 14,807 96 500.0 1 161,941 
PA 1.0050 0.0094 
mi027p86 CDL 1.0129 0.0051 
‘excluding salventietoms: Pl 1.0099 0.0053 1.021 0.0055 1.029 0.0015 
‘Snapshots from simulations were extracted every 0.2 ns. 
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Extended Data Figure 4 | Summary of statistics for molecular dynamics see Methods. b, Summary of statistics. c, Ratios of phospholipid bound to apo 
(MD) simulations followed by CCS filtering. a, Representative filtering CCS values for membrane protein complexes. Reported are the mean and 
procedure of AqpZ in a PC bilayer from the MD (top). For each frame, lipids _ standard deviation (n = 3). d, MD filtered by CCS resulted in similar patches of 
are extracted within 6 A of the protein (bottom left) then filtered by the PC molecules across different time points within the simulation indicating the 
experimental CCS of AqpZ bound to a single PC molecule (bottom right); systems were equilibrated. 
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Extended Data Figure 5 | MD simulations filtered by CCS reveals probable 
one (1X) and two (2X) phospholipid binding sites. a, For each candidate 
structure, protein and 1X or 2X PC molecule(s), the ratio of their calculated 
CCS values was determined (CCS ratio). This procedure generated a large 
number of candidate structures (grey bars) that were filtered using our CCS 
measurements (cyan line). The structures in grey that intercept this curve are 
essentially the ones selected as our most probable ensemble. b, The intersection 
between the simulated lipid complexes and the experimental data are then 
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projected onto the surface of the protein to identify the most probable binding 
sites. Probable lipid locations for MscL resembled an annular belt, with no 
specific patches of lipids probably stemming from the relatively cylindrical 
geometry of this complex. By contrast, for AqpZ and AmtB the most probable 
location of the lipid molecules were localized to the interfacial regions between 
protein subunits, as well as other probable locations on individual monomers. 
c, X-ray derived PG (blue spheres with white tails) located at the subunit 
interfaces agrees with the predicted PC binding site. 
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Extended Data Figure 6 | Modelling and quantification of gas-phase 
unfolding pathways. a, Representative ion mobility mass spectra are collected 
over a range of collision voltages. b, Ion arrival time is converted to CCS before 
generating unfolding plots. c, Model fitting process from 2D to 3D data (see 
Methods). d, Both a Synapt2 modified with a linear drift cell (DT-IMS) and 
the commercially available travelling wave SynaptG2 (TW-IMS) produce 
qualitatively similar unfolding data. e, A contour plot representing the variance 
of CCS of two gas-phase unfolding species as a function of ion mobility drift cell 


potential. f, Stacked plots of arrival time distributions for two gas-phase 
unfolding species as a function of drift cell potential. The lifetime of unfolding 
protein complexes in the drift tube ranges from 4 to 15 ms depending on the 
drift cell potential. No additional unfolding post activation occurs implying that 
the unfolding mechanism is not consistent with an irreversible unfolding 
model. Such a mechanism would predict time dependence on the population of 
unfolded species. By contrast the unfolding mechanism is well described by the 
reversible unfolding mechanism (see Methods). 
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Extended Data Figure 7 | AqpZ, AmtB and AmtBN”?“/ bound to of AqpZ bound to phospholipids. b, Representative mass and ion mobility 


various phospholipid species. a, Representative mass and ion mobility spectra spectra of AmtB bound to phospholipids and AmtBN’*4/N”°4 bound to PG. 
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Total Polar Lipid Extract AqpZ Kwat (S") Standard Error 


: EPL + 92.6 a7 
BKT22 1 = 
; > : BKT22 + 35.6 5.6 
a ; ed 
— *» s ; BKT22-YC + 82.9 5.1 
mol % EPL - 1.2 0.023 
Lipid Extract | PE PG CDL 
BKT22-YC | 71.7 21.7 6.6 BKT22 = 0.4 0.005 
BKT22 74.2 25.8 = 
EPL sao) ano. Bf BKT22-YC - 0.5 0.011 
Extended Data Figure 8 | Summary of water permeability assays and quantified by densiometry. b, Reported are the rate constants (k,,a.) and 


analysis of lipid extracts. a, HPTLC analysis of total polar lipid extract from _ standard error of replicates (n = 5) for empty liposomes (—) and AqpZ 
wild-type E. coli (EPL), cardiolipin-deficient strain (BKT22), or BKT22 cells proteoliposomes (+) reconstituted in differing E. coli lipid compositions. 
expressing ClsC and YmdB (BKT22-YC) to restore cardiolipin. Lipids were 
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a 

b 
Extended Data Figure 9 | Structural analysis of AmtB bound to PG. AmtB bound to PG (green chain, this work) aligned with the AmtB structure 
a, Crystal packing with six AmtB (multicoloured) and eight PG (orange) (maroon chain, PDB: 1U7G). d, Structure overlay of AmtB-GlnK complex 
molecules located in the asymmetric unit cell and symmetry related molecules _ bound to octylglucoside (purple chain, PDB: 2NS1) aligned with AmtB bound 
shown in grey and light orange, respectively. b, F,-F. and 2F,-F, electron to PG reveals a distinct conformational change. The lipid—water interface 
density maps after refinement without lipid and near-lipid water molecule (grey plane) was determined from coordinates of phosphate atoms from 
(if present) contoured at 2.0 and 1.0 sigma, respectively. c, Comparison of bound PG. 
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Extended Data Table 1| Summary of X-ray data collection and 
refinement statistics 


Data collection 
Space group 
Cell dimensions 


AmtB bound to PG 


C222; 


a, b,c (A) 116.2, 201.2, 232.5 
a, By (°) 90, 90, 90 
Resolution (A) 40-2.3 (2.4-2.3)° 
Rinerge (Jo) 12.4 (120.4) 
I/ol 13.7 (2.5) 
Completeness (%) 99.9 (99.5) 
Redundancy 11.0 (6.9) 


Refinement 


Resolution (A) 38.7-2.3 (2.36-2.3) 
No. reflections 119,766 
Ryork! Riree 20.2 / 23.48 
No. atoms 
Protein 15,856 
Ligand/ion 293 
Water 111 
B-factors 
Protein 45.3 
Ligand/ion 64.7 
Water 29.5 
R.m.s. deviations 
Bond lengths (A) 0.016 
Bond angles (°) 1.71 


“Values in parentheses are for highest-resolution shell. 


, Rye calculated using 5% of the data. 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature13431 


Corrigendum: Realizing the promise 
of cancer predisposition genes 
Nazneen Rahman 


Nature 505, 302-308 (2014); doi:10.1038/nature12981 


In Figure 1 of this Article, the gene RET was incorrectly repeated on 
chromosome 12 and the genes BRIP1 and TRIM37 were omitted from 
chromosome 17. We thank Eric Song from Nanjing University, China, 
for bringing these inconsistencies to our attention. Figure 1 has been 
corrected online. 
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CORRECTIONS & AMENDMENTS 


ERRATUM 
doi:10.1038/nature13463 


Erratum: Recent advances in 


homogeneous nickel catalysis 
Sarah Z. Tasker, Eric A. Standley & Timothy F. Jamison 


Nature 509, 299-309 (2014); doi:10.1038/nature13274 


In the first paragraph of this Review, the words ‘such as facile oxida- 
tive addition and ready access to multiple oxidation states’ were inad- 
vertently repeated in the print version. The paper is correct online. 
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Amove from China to 
America pays off for epigeneticist p.179 
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The band Bad Religion is fronted by evolution researcher Greg Graffin. 


SCIENCE AND THE ARTS 


Rock and researc 


Scientists who moonlight as musicians get more from a gig than a fistful of cash. 


BY KAREN KAPLAN 


. yates are used to facing the music. 


Failure stares them down every time their 
grant is denied, their paper is rejected, their 
fellowship is declined or their experiment flops. 
But many deal with the anguish of figura- 
tively facing the music by literally making it. 
Although no formal statistics document the 
number of researchers who are professional or 
amateur musicians, anecdotal evidence sug- 
gests that they are legion. Those who cultivate 
a second life as musicians say that the misery 
of failed experiments, vetoed grant applications 
and rebuffed manuscripts falls away whenever 
they pick up their guitar, trumpet or drumsticks. 
One might wonder why early-career 
researchers — who typically work 70-80-hour- 
plus weeks — spend what little spare time they 
have rehearsing, performing and recording 


music when they could be, say, sleeping, or 
spending that time with their family. In part, 
say scientist-musicians, it is because performing 
live onstage or setting down tracks in the studio 
is akin to a powerfully addictive drug: it gives 
them such a rush that they must keep coming 
back for more. Indeed, research has confirmed 
that creating, performing and listening to music 
produces dopamine, the same neurochemical 
released during sex and other such pleasur- 
able experiences (V. N. Salimpoor et al. Nature 
Neurosci. 14, 257-262; 2011). But scientist- 
musicians also say that making music helps 
them to unwind and recalibrate. Rumination, 
recrimination or self-flagellation about what 
went wrong with the research programme or 
why the tenure application was turned down 
disappears when they focus on the next note or 
chord and the audience's response. 

It has another advantage, too. For many who 
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pursue music, it feeds and bolsters their science. 
Freeing their mind from the tightly structured 
rigours of an experiment and the tedium of data 
collection to wander through fields of melodies 
and measures nurtures their scientific creativity. 
Days, even hours, after a particularly satisfying 
performance, writing stint or rehearsal session, 
a researcher often finds that she or he can sud- 
denly devise a better way to approach a study, 
develop a stronger hypothesis for a manuscript 
or come up with a different rationale for a grant. 

There is, of course, another, more fundamen- 
tal reason for the time sacrifice: performing 
music onstage, writing and even rehearsing it 
is great fun. 

“You get obsessed with it,” says Joseph 
LeDoux, a neuroscientist at New York Uni- 
versity and singer-songwriter and guitarist 
for blues-rock band The Amygdaloids, who 
perform at clubs and other venues and have 
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> recorded two original CDs and an extended 
play. “A lot of what I do for the band is write, 
and I’m a decent writer. It’s not the same kind of 
writing as a paper or a grant, butit’s freeing and 
releasing to write that way.” 

The immediate audience feedback is a huge 
part of live music’s allure and a counterpoint to 
science’ slower pace, says Cliff Schweinfest, sci- 
entific review officer for the US National Cancer 
Institute in Rockville, Maryland, and trumpet 
player for the NIH Community Orchestra, a 
volunteer group of classical and Renaissance 
musicians whose day jobs are with the US 
National Institutes of Health (NIH) and other 
federal agencies. “Music gives you an instanta- 
neous pay-off, he says. “That's very satisfying 
— it'sa good complement to science, which has 
a much slower pace.” 

The music-science interplay can have real 
benefits. One study, for instance, has found that 
people who form companies and file patents 
are much more likely to be involved in the arts 
than are those who have never done these things 
(R. LaMore et al. Econ. Dev. Q. 27, 221-229; 
2013). Long-term participation may enhance 
creative potential in science and technology, 
and scientists who are musicians are likely to be 
highly successful, publishing provocative papers 
or producing many patentable inventions, says 
psychologist Robert Root-Bernstein from 
Michigan State University in East Lansing, and 
an author of the study. “They are more success- 
ful by any set of criteria, including publishing 
and grants,’ he says. “They are more innovative. 
A lifelong exposure to music helps to foster a 
more creative approach to work” 


BLURRED BOUNDARIES 

So tightly intertwined are science and music 
for evolutionary researcher Greg Graffin, who 
fronts the punk-rock outfit Bad Religion in 
Los Angeles, California, that the lines between 
them often blur. Lead singer and songwriter for 
the band since he was 15, Graffin perceives a 
profound connection between the anti-estab- 
lishment, alienative nature of punk music and 
the demands of scientific research for rational 
thinking and for challenging existing doctrine. 
The mindset has piloted both his science and his 
music for the past 35 years. “That was the way 
I was thinking when I was 16 — to challenge 
authority to make headway in this world,’ he 
says. “It’s worked for me in punk, and it’s what I 
do asa scientist.” Of the four books he has pub- 
lished, Anarchy Evolution: Faith, Science, and 
Bad Religion in a World Without God is perhaps 
the most iconoclastic, examining the conflict 
between religion and science. 

Hugely successful as a punker — Bad Reli- 
gion headlined iconic clubs such as Whiskey a 
Go-Go in West Hollywood, California, when 
he was 17 — Graffin slowed his research career 
for a while, fitting in a doctorate, a postdoc and 
stints at the bench between tours and records. 
But he could never bring himself to leave sci- 
ence completely for music, lucrative though his 
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Punk band the Descendents release albums around research and family commitments. 


music career has been. He is now a lecturer in 
evolution at Cornell University in Ithaca, New 
York, and is preparing to publish a book on 
the biological basis of species coexistence. He 
teaches and conducts research in the autumn 
and spring, and reserves summers for touring, 
writing and recording — because even now, 
decades later, music still holds a seductive allure. 

“To this day, there’s nothing more rewarding 
than playing concerts and just seeing the peo- 
ple mouth the words and sing along,” he says. 
“There's such a dichotomy — the lights, the 
stage, the sensation of the moment — and the 
inverse of that, the quiet solitary work of data 
collection.” 

Although his music and his science are 
closely knit in his mind, Graffin tries to keep 
them separate, espe- 


cially for his students, “There ’ssucha 
in an effort to pre- dichotomy — the 
serve his scientific lights, the stage, 
identity. “Inalecture, the sensation of 
Idon'teventalkabout the moment — 
music, but [students] and the inverse 
come up after class” of that, the quiet 
he says. “Idon't want solitary work of 


to promote that part datacollection.” 


of my life — in a sense 
they're indulging me, to see me as a scientist. 
I'm walking the line between the two worlds.” 
Plant geneticist Milo Aukerman also strad- 
dles an oft-indistinct boundary between 
research and music. Long-time frontman and 
lead singer for Los Angeles-based early pun- 
kers the Descendents, Aukerman followed 
a path similar to Graffin’s, sandwiching his 
graduate programme and postdoc between 
tours and recording sessions. After completing 
his postdoc in 1999 — it took six years, thanks 
to a series of leaves for touring and recording 
— Aukerman joined DuPont in Wilmington, 
Delaware, in 2001. Then music came knock- 
ing again. “I hadn't been at DuPont for more 
than a year when the band called. I said, ‘I can’t 


© 2014 Macmillan Publishers Limited. All rights reserved 


tour, and they said, “How about a record?; and 
I said ‘I could squeeze that in,” he recalls. The 
CD was released in 2004 to significant acclaim, 
but Aukerman had decided it was time to focus 
again on his science — until 2010, when the 
muse called him back once more. Now, he joins 
the band for weekend shows and brief interna- 
tional tours that he fits in during holidays. 


COMPLEMENTARY CURES 

Music and science, he says, are different sides 
of the same coin — they both draw on creative 
energies and abilities. “From the very beginning 
of my interest in biology, it was about what you 
could do. This was a raw substance, this was 
like something you sculpted, that you could 
create with,” Aukerman says. “It’s the same way 
in music. Having innovated — writing a song, 
interpreting a riff in a different way — brings 
the same exhilaration that you get from science.” 

And the research can inform the music. The 
Amygdaloids’ original tunes are about the brain 
or the mind in some way — and LeDoux finds 
that the music affects his work in a positive fash- 
ion. “I've got much better at being a performer 
—Tm more relaxed now, more spontaneous, 
and that also helps flipping back to science — 
I'm more willing to free-associate,’ he says, 
adding that being a rocker has improved his 
performance on the podium when he gives 
talks. “During a science lecture, you might be 
in your own mind, facing the screen, looking for 
the next slide, but with music, you've got to be 
right there — onstage as a musician, it’s all about 
grabbing the audience. And I’ve become much 
more conscious of that in my work? 

For some, the music offers more direct 
insights: it is part and parcel of the research 
itself. Thomas Scelo, an acoustics consultant 
and classical guitarist, knows that the connec- 
tion between his music and his research is sub- 
stantive, even crucial. The director of Marshall 
Day Acoustics in Hong Kong, Scelo works with 
cities, private companies, architects and others 
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